ANALYTICAL CHEMISTRY - CLUTCH 1E CH STATISTICS, QUALITY ASSURANCE AND CALIBRATION METHODS

Similar documents
Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Topic 2 Measurement and Calculations in Chemistry

Statistics: Error (Chpt. 5)

Statistical Analysis of Chemical Data Chapter 4

Lecture 11. Data Description Estimation

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics

Elementary Statistics

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

How to Describe Accuracy

Objective A: Mean, Median and Mode Three measures of central of tendency: the mean, the median, and the mode.

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

Chapter 9. Hypothesis testing. 9.1 Introduction

INTERVAL ESTIMATION OF THE DIFFERENCE BETWEEN TWO POPULATION PARAMETERS

A is one of the categories into which qualitative data can be classified.

Measures of Central Tendency and their dispersion and applications. Acknowledgement: Dr Muslima Ejaz

CBA4 is live in practice mode this week exam mode from Saturday!

Section 9 2B:!! Using Confidence Intervals to Estimate the Difference ( µ 1 µ 2 ) in Two Population Means using Two Independent Samples.

Continuous random variables

Int Math 1 Statistic and Probability. Name:

Practice problems from chapters 2 and 3

Chapter 3: The Normal Distributions

EQ: What is a normal distribution?

Determining the Spread of a Distribution

Chem 321 Lecture 4 - Experimental Errors and Statistics 9/5/13

Determining the Spread of a Distribution Variance & Standard Deviation

Determining the Spread of a Distribution

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Data analysis and Geostatistics - lecture VI

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Lecture 8: Chapter 4, Section 4 Quantitative Variables (Normal)

Chem 321 Lecture 5 - Experimental Errors and Statistics 9/10/13

Midterm 1 and 2 results

Chapter 2: Tools for Exploring Univariate Data

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound

CHAPTER 1. Introduction

SMAM 314 Exam 42 Name

CIVL 7012/8012. Collection and Analysis of Information

GRACEY/STATISTICS CH. 3. CHAPTER PROBLEM Do women really talk more than men? Science, Vol. 317, No. 5834). The study

Estimating a population mean

Introduction to Statistics

QUANTITATIVE DATA. UNIVARIATE DATA data for one variable

11. The Normal distributions

Inference for Distributions Inference for the Mean of a Population

6 Single Sample Methods for a Location Parameter

Lecture 3. - all digits that are certain plus one which contains some uncertainty are said to be significant figures

Chapter 1 - Lecture 3 Measures of Location

Lecture # 31. Questions of Marks 3. Question: Solution:

F78SC2 Notes 2 RJRC. If the interest rate is 5%, we substitute x = 0.05 in the formula. This gives

Lecture Notes for BUSINESS STATISTICS - BMGT 571. Chapters 1 through 6. Professor Ahmadi, Ph.D. Department of Management

The Empirical Rule, z-scores, and the Rare Event Approach

are the objects described by a set of data. They may be people, animals or things.

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population

Sampling, Frequency Distributions, and Graphs (12.1)

Chapter 6: SAMPLING DISTRIBUTIONS

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Comparison of Two Population Means

What is statistics? Statistics is the science of: Collecting information. Organizing and summarizing the information collected

MATH 1150 Chapter 2 Notation and Terminology

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Data Analysis II. CU- Boulder CHEM-4181 Instrumental Analysis Laboratory. Prof. Jose-Luis Jimenez Spring 2007

The Components of a Statistical Hypothesis Testing Problem

Chapter 7 Class Notes Comparison of Two Independent Samples

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

Measures of Dispersion

WELCOME!! LABORATORY MATH PERCENT CONCENTRATION. Things to do ASAP: Concepts to deal with:

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 3.1- #

The empirical ( ) rule

STAT 155 Introductory Statistics. Lecture 6: The Normal Distributions (II)

download instant at

The number of daily sleep hours can be used to determine the amount of available study hours.

TOPIC: Descriptive Statistics Single Variable

a) The runner completes his next 1500 meter race in under 4 minutes: <

Example: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight?

Experimental design. Matti Hotokka Department of Physical Chemistry Åbo Akademi University

Standard normal distribution. t-distribution, (df=5) t-distribution, (df=2) PDF created with pdffactory Pro trial version

Chapter 6 Continuous Probability Distributions

Analytical Chemistry. Course Philosophy

MAT2377. Rafa l Kulik. Version 2015/November/23. Rafa l Kulik

Statistical Methods for Astronomy

Chem 4331 Name : Final Exam 2008

STAT Chapter 8: Hypothesis Tests

OPIM 303, Managerial Statistics H Guy Williams, 2006

Lesson 5.4: The Normal Distribution, page 251

Data Analysis and Statistical Methods Statistics 651

MATH 3200 PROBABILITY AND STATISTICS M3200SP081.1

Looking at data: distributions - Density curves and Normal distributions. Copyright Brigitte Baldi 2005 Modified by R. Gordon 2009.

Measures of Central Tendency

Chapter 7. Practice Exam Questions and Solutions for Final Exam, Spring 2009 Statistics 301, Professor Wardrop

Chapter 2. Mean and Standard Deviation

Estimation and Confidence Intervals

An inferential procedure to use sample data to understand a population Procedures

Resistant Measure - A statistic that is not affected very much by extreme observations.

4.1 Hypothesis Testing

Exercises from Chapter 3, Section 1

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Transcription:

!! www.clutchprep.com

CONCEPT: MEAN EVALUATION The measures how close data results are in relation to the mean or average value. s = i (x i x) n 1 = Individual Measurement = Average or Mean = variance = Number of Measurements = Degrees of Freedom = Relative Standard Deviation (Coefficient of Variation) EXAMPLE: Data below gives the volumes obtained by a chemist from the use of a pipet. Determine the standard deviation. 4.9 ml, 5.0 ml, 4.8 ml, 4.6 ml, 4.6 ml, 4.3 ml Volume (xi) Difference from the mean (x i x) Difference from the mean squared (x i x) i (x i x) Page

CONCEPT: THE GAUSSIAN DISTRIBUTION Performing an experiment numerous times with no systematic error results in a smooth curve called the Gaussian Distribution. f(x) µ σ x f(x) = e =.7188 1 σ π e (x µ) /σ In terms of the Gaussian Distribution curve, increasing the number of measurements in the experiment: Changes the mean,, to mu, to represent the population mean. Changes the standard deviation,, to sigma,, to represent the population standard deviation. The shape of the Gaussian Distribution curve can occur by: Changing, which will shift the distribution curve to the left or right. Changing, which will increase or decrease the broadness of the distribution curve. Normally each distributed variable has its own mean and standard deviation. The standard normal distribution simplifies this by setting the mean at and standard deviation in units of. f(x) Standard Normal Distribution Formula y = e z / π z = Abscissa (Z-Score) Value mean Standard Deviation = X µ σ of the data falls between the -1 to +1 area Standard Deviation Z-Score Cumulative % σ -3 - -1 0 +1 + +3-3 - -1 0 +1 + +3 0.1%.3% 15.9% 50% 84.1% 97.7% 99.9% x of the data falls between the - to + area of the data falls between the -3 to +3 area Page 3

PRACTICE: THE GAUSSIAN DISTRIBUTION & Z-TABLES The use of Z-Tables is essential in the determination of probabilities. Probability -3 z 0 +3 Probability -3 0 z +3 Page 4

PRACTICE: THE GAUSSIAN DISTRIBUTION & Z-TABLES CALCULATIONS 1 EXAMPLE 1: Suppose there are 100 students in your analytical lecture and at the end of the semester the class average is an 80 with a standard deviation of 5.3, determine the distribution and probability of grades based on your understanding of the Gaussian distribution curve. f(x) -3 - -1 0 +1 + +3 x Standard Deviation EXAMPLE : From EXAMPLE 1, determine the percentage of final grades that would lie below 71. f(x) -3 - -1 0 +1 + +3 x Standard Deviation PRACTICE: From EXAMPLE 1, determine the percentage of final grades that would lie between 88 to 9. f(x) -3 - -1 0 +1 + +3 x Standard Deviation Page 5

CONCEPT: CONFIDENCE INTERVALS A confidence interval is a specific interval estimate of a parameter determined by using data obtained from a sample. For example a 95% confidence interval means we are 95% confident the mean lies within a given interval. = Student's t = standard deviation Confidence int erval = x ± ts n = # of measurements = average or mean A Student s t is a statistical table used in our understanding of confidence intervals and in the comparative data from different experiments. EXAMPLE: Construct a 95% confidence interval for an experiment that found the mean temperature for a given city in July as 103.5 o C with a standard deviation of 1.8 from 10 measurements. Page 6

PRACTICE: CONFIDENCE INTERVALS CALCULATIONS 1 EXAMPLE 1: The barium content of a metal ore was analyzed several times by a percent composition process. Calculate the mean, median and mode. 0.010 0.011 0.004 0.011 EXAMPLE : From EXAMPLE 1, calculate the standard deviation. PRACTICE: From the examples given above, find the 90% confidence interval. Page 7

CONCEPT: T-TEST The t-test is used to test the of two populations, one of which could be a standard. In order to test the similarities and differences between these two populations you can utilize the t-score. Use the t score formula when we don t know the population standard deviation and have a sample size less than. t = x µ 0 s n = sample average = population average = sample standard deviation = number of samples The larger the t-score then the more the populations. The smaller the t-score then the more the populations. t-calculated (for equal variance) t Calculated = x1 x s pooled n s pooled = s 1 ( 1)+s (n 1) + n n 1 + n t-calculated (for unequal variance) Degrees of freedom = + n t calculated = x1 x s 1 + s n Degrees of freedom = s 1 + s n s 1 s 1 + n n 1 t-calculated (paired data) t Calculated = d s n s = Σ (d i d) n 1 Page 8

PRACTICE: T-TEST CALCULATIONS 1 EXAMPLE: A student wishing to calculate the amount of arsenic in cigarettes decides to run two separate methods in her analysis. The results (shown in ppm) are shown below: Sample Method 1 Method 1 110.5 104.7 93.1 95.8 3 63.0 71. 4 7.3 69.9 5 11.6 118.7 Is there a significant difference between the two analytical methods under a 95% confidence interval? Page 9

PRACTICE: T-TEST CALCULATIONS EXAMPLE: You want to determine if concentrations of hydrocarbons in seawater measured by fluorescence are significantly different than concentrations measured by a second method, specifically based on the use of gas chromatography/flame ionization detection (GC-FID). You measure the concentration of a certified standard reference material (100.0 µm) with both methods seven (n=7) times. Specifically, you first measure each sample by fluorescence, and then measure the same sample by GC-FID. The concentrations determined by the two methods are shown below. [fluorene (µm)] Sample Fluorescence GC-FID 1 100. 101.1 100.9 100.5 3 99.9 100. 4 100.1 100. 5 100.1 99.8 6 101.1 100.7 7 100.0 99.9 Calculate the appropriate t-statistic to compare the two sets of measurements. Page 10

PRACTICE: T-TEST CALCULATIONS 3 EXAMPLE: A sample of size n = 100 produced the sample mean of 16. Assuming the population deviation is 3, compute a 95% confidence interval for the population mean. PRACTICE: The average height of the US male is approximately 68 inches. What is the probability of selecting a group of males with average height of 7 inches or greater with a standard deviation of 5 inches? Probability -3 0 z +3 Page 11

CONCEPT: F-TEST The f-test is used to test the of two populations, which recall is equal to the standard deviation. FCalculated represents the quotient of the squares of the standard deviations: F Calculated = s 1 s When calculating the f quotient always set the larger standard deviation as the numerator so that F 1 If FCalculated FTable then the difference will not be significant. t Calculated = x1 x s pooled n s pooled = s 1 ( 1)+s (n 1) + n + n If FCalculated FTable then the difference will be significant. t calculated = x1 x s 1 + s n s 1 + s n s 1 s 1 + n n 1 Degrees of Degrees of Freedom for s 1 Freedom for s 3 4 5 6 7 8 9 10 1 15 0 30 Page 1

PRACTICE: F-TEST CALCULATIONS 1 EXAMPLE 1: In the process of assessing responsibility for an oil spill, two possible suspects are identified. To differentiate between the two samples of oil, the ratio of the concentration for two polyaromatic hydrocarbons is measured using fluorescence spectroscopy. These values are then compared to the sample obtained from the body of water: Mean Standard Deviation # Samples Suspect 1.31 0.073 4 Suspect.67 0.09 5 Sample.45 0.088 6 From the above results, should there be a concern that any combination of the standard deviation values demonstrates a significant difference? EXAMPLE : Can either (or both) of the suspects be eliminated based on the results of the analysis at the 99% confidence interval? Page 13

PRACTICE: F-TEST CALCULATIONS EXAMPLE 1: You are measuring the effects of a toxic compound on an enzyme. You expose five (test tubes of cells to 100 µl of a 5 ppm aqueous solution of the toxic compound and mark them as treated, and expose five test tubes of cells to an equal volume of only water and mark them as untreated. You then measure the enzyme activity of cells in each test tube; enzyme activity is in units of µmol/minute. The following are the measurements of enzyme activity: Activity (Treated) Activity (Untreated) Tube (µmol/min) Tube (µmol/min) 1 3.5 1 5.84 3.98 6.59 3 3.79 3 5.97 4 4.15 4 6.5 5 4.04 5 6.10 Average: 3.84 Average: 6.15 Standard Standard Deviation: 0.36 Deviation: 0.9 Is the variance of the measured enzyme activity of cells exposed to the toxic compound equal to that of cells exposed to water alone? EXAMPLE : Is the average enzyme activity measured for cells exposed to the toxic compound significantly different (at 95% confidence level) than that measured for cells exposed to water alone? Page 14

CONCEPT: DETECTION OF GROSS ERRORS Grubbs test is used to detect a single outlier in a single variable data set that follows some type of normal distribution. Grubbs Test G Calculated = Questionable value x s Number G Table or G Critical of Observations (90% Confidence) (95% Confidence) (99% Confidence) 3 1.153 1.154 1.155 4 1.463 1.481 1.496 5 1.671 1.715 1.764 G Table < G Calculated G Table > G Calculated Disregard Value Hold Value 6 1.8 1.887 1.973 7 1.938.00.139 8.03.17.74 9.110.15.387 10.176.90.48 The Q-Test is another method used in finding outliers in very small, normally distributed, data sets. The number of measurements is normally between 3 to 7 values. Q-Test Q Calculated = Gap Range = x 1 x n+1 r x 1 = x n+1 = r = range (largest smallest value in data set) Q Table < Q Calculated Disregard Value Number Q Table or Q Critical of Observations (90% Confidence) (95% Confidence) (99% Confidence) 3 0.941 0.970 0.994 4 0.765 0.89 0.96 5 0.64 0.710 0.81 6 0.560 0.65 0.740 7 0.507 0.568 0.680 8 0.468 0.56 0.634 Q Table > Q Calculated Retain Value 9 0.437 0.493 0.598 10 0.41 0.466 0.568 Page 15

PRACTICE: DETECTION OF GROSS ERRORS CALCULATIONS 1 EXAMPLE 1: Wishing to measure the amount of caffeine in a cup of coffee you pour ten cups. From the data provided perform a Q-test to determine if the outlier can be retained or disregarded. Caffeine per cup of coffee Cup of Coffee 1 3 4 5 6 7 8 9 10 ppm of coffee 81 83 78 8 7 79 77 81 8 78 EXAMPLE : White blood cells are the defending cells of the human immune system and fight against infectious diseases. Provided below is the normal white blood cell counts for a healthy adult woman. Determine if the current white blood cell count is reasonable by Grubbs test. "Normal" Days Today 5.1 10 6 cells /µl 5.4 10 6 cells /µl 4.9 10 6 cells /µl 5. 10 6 cells /µl 5.3 10 6 cells /µl 5.0 10 6 cells /µl 6.1 10 6 cells /µl Page 16