Practical Statistics for the Analytical Scientist Table of Contents

Similar documents
FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Data Analysis. with Excel. An introduction for Physical scientists. LesKirkup university of Technology, Sydney CAMBRIDGE UNIVERSITY PRESS

THE PRINCIPLES AND PRACTICE OF STATISTICS IN BIOLOGICAL RESEARCH. Robert R. SOKAL and F. James ROHLF. State University of New York at Stony Brook

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

STATISTICS ( CODE NO. 08 ) PAPER I PART - I

Subject CS1 Actuarial Statistics 1 Core Principles

Transition Passage to Descriptive Statistics 28

Prentice Hall Stats: Modeling the World 2004 (Bock) Correlated to: National Advanced Placement (AP) Statistics Course Outline (Grades 9-12)

HANDBOOK OF APPLICABLE MATHEMATICS

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

REVIEW: Midterm Exam. Spring 2012

Index. Cambridge University Press Data Analysis for Physical Scientists: Featuring Excel Les Kirkup Index More information

Glossary for the Triola Statistics Series

3 Joint Distributions 71

Contents. Acknowledgments. xix

Statistics Toolbox 6. Apply statistical algorithms and probability models

Kumaun University Nainital

Basic Statistical Analysis

INTRODUCTION TO DESIGN AND ANALYSIS OF EXPERIMENTS

STATISTICS SYLLABUS UNIT I

Learning Objectives for Stat 225

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.

Experimental Design and Data Analysis for Biologists

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Institute of Actuaries of India

Chapter 1 Statistical Inference

Objective Experiments Glossary of Statistical Terms

sphericity, 5-29, 5-32 residuals, 7-1 spread and level, 2-17 t test, 1-13 transformations, 2-15 violations, 1-19

AP Statistics Cumulative AP Exam Study Guide

STAT 200 Chapter 1 Looking at Data - Distributions

DESIGN AND ANALYSIS OF EXPERIMENTS Third Edition

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION

Exam details. Final Review Session. Things to Review

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Sociology 6Z03 Review I

Micro Syllabus for Statistics (B.Sc. CSIT) program

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

Standard Terminology Relating to Quality and Statistics 1

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound

STATISTICS; An Introductory Analysis. 2nd hidition TARO YAMANE NEW YORK UNIVERSITY A HARPER INTERNATIONAL EDITION

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Statistics Handbook. All statistical tables were computed by the author.

1 Introduction to Minitab

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Measurement And Uncertainty

Application of mathematical, statistical, graphical or symbolic methods to maximize chemical information.

Chapter2 Description of samples and populations. 2.1 Introduction.

Revised: 2/19/09 Unit 1 Pre-Algebra Concepts and Operations Review

Module 1. Identify parts of an expression using vocabulary such as term, equation, inequality

ROSLYN PUBLIC SCHOOLS INTEGRATED ALGEBRA CURRICULUM. Day(s) Topic Textbook Workbook Additional Worksheets

BIOS 6222: Biostatistics II. Outline. Course Presentation. Course Presentation. Review of Basic Concepts. Why Nonparametrics.

Unit 27 One-Way Analysis of Variance

21 ST CENTURY LEARNING CURRICULUM FRAMEWORK PERFORMANCE RUBRICS FOR MATHEMATICS PRE-CALCULUS

CONTENTS PREFACE XI ABOUT THE AUTHOR XIII. CHAPTER 1 Research Strategies and the Control of Nuisance Variables 1

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.

Schedule for a proficiency test

Unit Six Information. EOCT Domain & Weight: Algebra Connections to Statistics and Probability - 15%

Resval. Practical tool for the validation of an analytical method and the quantification of the uncertainty of measurement.

Sociology 6Z03 Review II

Pharmaceutical Experimental Design and Interpretation

Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Protocol for the design, conducts and interpretation of collaborative studies (Resolution Oeno 6/2000)

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.2 Business Applications (III)

SPSS Guide For MMI 409

Algebra I. Standard Processes or Content Strand CLE Course Level Expectation SPI State Performance Indicator Check for Understanding

-However, this definition can be expanded to include: biology (biometrics), environmental science (environmetrics), economics (econometrics).

Taguchi Method and Robust Design: Tutorial and Guideline

CIVL 7012/8012. Collection and Analysis of Information

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Evaluate algebraic expressions for given values of the variables.

Review of Statistics 101

Introduction to the Design and Analysis of Experiments

Statistics and Measurement Concepts with OpenStat

California. Performance Indicator. Form B Teacher s Guide and Answer Key. Mathematics. Continental Press

Appendix A Summary of Tasks. Appendix Table of Contents

APPENDICES APPENDIX A. STATISTICAL TABLES AND CHARTS 651 APPENDIX B. BIBLIOGRAPHY 677 APPENDIX C. ANSWERS TO SELECTED EXERCISES 679

Statistical Analysis of Engineering Data The Bare Bones Edition. Precision, Bias, Accuracy, Measures of Precision, Propagation of Error

Area Formulas. Linear

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...

Chapter 3. Measuring data

Chap The McGraw-Hill Companies, Inc. All rights reserved.

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

GLOSSARY. a n + n. a n 1 b + + n. a n r b r + + n C 1. C r. C n

A Unified Approach to Uncertainty for Quality Improvement

University of Texas-Austin - Integration of Computing

Experimental Design and Optimization

STP 420 INTRODUCTION TO APPLIED STATISTICS NOTES

Chapter 1:Descriptive statistics

Chapter 4 Multi-factor Treatment Designs with Multiple Error Terms 93

Error analysis in biology

1. Exploratory Data Analysis

Transcription:

Practical Statistics for the Analytical Scientist Table of Contents Chapter 1 Introduction - Choosing the Correct Statistics 1.1 Introduction 1.2 Choosing the Right Statistical Procedures 1.2.1 Planning Experiments 1.2.2 Representative Test Portions for Large Samples 1.2.3 Reviewing and Checking Data 1.2.4 Reporting Results - Summarising and Describing Data 1.2.5 Decisions about Differences and Limits 1.2.5.1 Comparing Results with Limits 1.2.5.2 Differences Between Sets of Data 1.2.6 Calibrating Instruments 1.2.7 Describing Analytical Method Performance 1.2.8 Analytical Method Validation 1.2.9 Analytical Quality Control 1.2.10 Testing Laboratory Performance Proficiency Testing 1.2.11 Measurement Uncertainty Chapter 2 Graphical Methods 2.1 Some Example Data 2.2 Dot Plots 2.3 Stem-and-Leaf Plots 2.4 Tally Charts 2.5 Histograms 2.6 Frequency Polygon 2.7 Cumulative Distribution 2.8 Box Plots 2.9 Scatter Plots 2.10 Normal Probability Plots Chapter 3 Distributions and Types of Data 3.1 Introduction 3.2 Describing Distributions 3.3 Distributions of Analytical Data 3.3.1 The Normal Distribution 3.3.2 The Lognormal Distribution Page 1 of 10

3.3.3 Poisson and Binomial Distributions 3.4 Distributions Derived from the Normal Distribution 3.4.1 Distribution of Student s t 3.4.2 The Chi-Squared Distribution 3.4.3 The F Distribution 3.5 Other Distributions 3.5.1 Rectangular Distribution (or Uniform Distribution) 3.5.2 Triangular Distribution 3.6 Populations and Samples 3.7 Checking Normality 3.8 Types of Data Chapter 4 Basic Statistical Techniques 4.1 Summarising Data: Descriptive Statistics 4.1.1 Introduction 4.1.2 Counts, Frequencies and Degrees of Freedom 4.1.3 Measures of Location 4.1.3.1 Arithmetic Mean 4.1.3.2 Median 4.1.3.3 Mode 4.1.4 Measures of Dispersion 4.1.4.1 Variance 4.1.4.2 Standard Deviation 4.1.4.3 Standard Deviation of the Mean 4.1.4.4 Relative Standard Deviation and Coefficient of Variation 4.1.4.5 Range 4.1.4.6 Quartiles and the Inter-Quartile Range 4.1.5 Skewness 4.1.6 Kurtosis 4.2 Significance Testing 4.2.1 Introduction 4.2.2 A Procedure for Significance Testing 4.2.2.1 Steps in Significance Testing 4.2.2.2 Stating the Hypotheses 4.2.2.3 Checking the Distribution 4.2.2.4 Selecting the Appropriate Test 4.2.2.5 Choosing the Level of Significance 4.2.2.6 Choosing the Number of Tails Page 2 of 10

4.2.2.7 Calculating the Test Statistic 4.2.2.8 Obtaining the Critical Value 4.2.2.9 Compare the Test Statistic with the Critical Value 4.2.2.10 Using P-Values 4.2.3 Tests on One or Two Mean Values - The Student t-test 4.2.3.1 Comparing the Mean with a Stated Value (One-Sample Test) 4.2.3.2 Comparing the Means of Two Independent Sets of Data (Two-Sample Test) 4.2.3.3 Paired Comparisons 4.2.4 Comparing Two Observed Standard Deviations or Variances - The F-test 4.2.5 Comparing Observed Standard Deviation or Variance with an Expected or Required Standard Deviation Using Tables for the F Distribution 4.3 Confidence Intervals for Mean Values Chapter 5 Outliers in Analytical Data 5.1 Introduction 5.2 Outlier Tests 5.2.1 The Purpose of Outlier Tests 5.2.2 Action on Detecting Outliers 5.2.3 The Dixon Tests 5.2.3.1 The Dixon Q Test 5.2.3.2 Extended Tests 5.2.4 The Grubbs Tests 5.2.5 The Cochran test 5.3 Robust Statistics 5.3.1 Introduction 5.3.2 Robust Estimators for Population Means 5.3.2.1 Median and Trimmed Mean 5.3.2.2 The A15 Estimate 5.3.2.3 Huber s Proposal 2 5.3.3 Robust Estimates of Standard Deviation 5.3.3.1 Median Absolute Deviation 5.3.3.2 MAD E 5.3.3.3 Mean Absolute Deviation and smad 5.3.3.4 Huber s Proposal 2 5.4 When to use Robust Estimators Chapter 6 Analysis of Variance 6.1 Introduction Page 3 of 10

6.2 Interpretation of ANOVA Tables 6.2.1 Anatomy of an ANOVA Table 6.2.1.1 Rows in the ANOVA Table 6.2.1.2 Columns in the ANOVA Table 6.2.2 Interpretation of ANOVA Results 6.3 One-Way ANOVA 6.3.1 Data for One-Way ANOVA 6.3.2 Calculations for One-Way ANOVA 6.3.2.1 Calculating the ANOVA Table 6.3.2.2 Calculating Variance Components for One-Way ANOVA 6.4 Two-Factor ANOVA 6.4.1 Applications of Two-Factor ANOVA 6.5 Two-Factor ANOVA: Cross-classification 6.5.1 Two-Factor ANOVA for Cross-classification without Replication 6.5.1.1 Data for Two-Factor ANOVA without Replication 6.5.1.2 Results Table for Two-Factor ANOVA without Replication 6.5.1.3 F-Tests in Two-Factor ANOVA without Replication 6.5.2 Two-Factor ANOVA for Cross-classification with Replication 6.5.2.1 Data for Two-Factor ANOVA with Replication 6.5.2.2 The Concept of Interaction 6.5.2.3 Results Table for Two-Factor ANOVA with Replication 6.5.2.4 F-Tests in Two-Factor ANOVA with Replication 6.5.2.5 Creating and Interpreting Interaction Plots 6.5.2.6 Random Effects and Two-Factor ANOVA with Replication 6.6 Two-Factor ANOVA for Nested Designs (Hierarchical Classification) 6.6.1 Data for Two-Factor ANOVA for Nested Designs 6.6.2 Results Table for Two-Factor ANOVA for Nested Designs 6.6.3 Variance Components 6.6.4 F-Tests for Two-Factor ANOVA on Nested Designs 6.7 Checking Assumptions for ANOVA 6.7.1 Checking Normality 6.7.2 Checking Homogeneity of Variance Levene s Test 6.8 Missing Data in ANOVA Appendix: Manual Calculations for ANOVA Chapter 7 Regression 7.1 Linear Regression 7.1.1 Introduction to Linear Regression 7.1.2 Assumptions in Linear Regression Page 4 of 10

7.1.3 Visual Examination of Regression Data 7.1.4 Calculating the Gradient and Intercept 7.1.5 Inspecting the Residuals 7.1.6 The Correlation Coefficient 7.1.6.1 Calculating the Correlation Coefficient 7.1.6.2 Interpretation of the Correlation Coefficient 7.1.7 Uncertainty in Predicted Values of x 7.1.8 Interpreting Regression Statistics from Software 7.1.8.1 Statistics Describing Gradient and Intercept Estimates 7.1.8.2 ANOVA Tables for Linear Regression 7.1.9 Testing for Non-Linearity 7.1.9.1 Inspection of the Residuals 7.1.9.2 F-Test of Residual Standard Deviation Against Repeatability Standard Deviation 7.1.9.3 ANOVA Applied to Residuals 7.1.9.4 Testing for Significant Higher-Order Terms 7.1.10 Designing Linear Calibration Experiments 7.1.10.1 Minimising Prediction Uncertainty 7.1.10.2 Replication in Calibration 7.1.10.3 Summary of Linear Calibration Design Recommendations 7.1.10.4 Planning Linearity Checks 7.1.11 Two Common Mistakes 7.2 Polynomial Regression 7.2.1 Polynomial Curves and Non-Linearity 7.2.2 Fitting a Quadratic (Second Order Polynomial) 7.2.3 Using Polynomial Regression for Checking Linearity Appendix: Calculations for Polynomial Regression Chapter 8 Designing Effective Experiments 8.1 Some New Terminology 8.2 Planning for Statistical Analysis 8.2.1 Measuring the Right Effect 8.2.2 Single-Versus Multi-Factor Experiments 8.3 General Principles 8.4 Basic Experimental Designs for Analytical Science 8.4.1 Simple Replication 8.4.2 Linear Calibration Designs 8.4.3 Nested Designs 8.4.4 Factorial Designs Page 5 of 10

8.5 Number of samples 8.5.1 Number of Samples for a Desired Standard Deviation of the Mean 8.5.2 Number of Samples for a Given Confidence Interval Width about a Mean Value 8.5.3 Number of Samples for a Desired t-test Power 8.5.4 Number of Observations for Other Applications and Tests 8.6 Controlling Nuisance Effects 8.6.1 Randomisation 8.6.1.1 The Advantage of Randomisation 8.6.1.2 Implementing Randomisation 8.6.1.3 Limitations of Randomisation 8.6.2 Pairing 8.6.3 Blocked Designs 8.6.3.1 Introduction to Blocking 8.6.3.2 Analysing Blocked Experiments 8.6.4 Latin Square and Related Designs 8.6.5 Validating Experimental Designs 8.7 Advanced Experimental Designs 8.7.1 Fractional Factorial Designs 8.7.2 Optimisation Designs 8.7.2.1 Optimisation and Response Surface Modelling 8.7.2.2 Star Design 8.7.2.3 Central Composite Designs 8.7.2.4 Box-Behnken Design 8.7.2.5 Data Processing for Response Surface Designs 8.7.3 Mixture Designs 8.7.4 D-Optimal Designs 8.7.5 Advanced Blocking Strategies 8.7.5.1 Simple Blocking Extended to Multi-Factor Experiments 8.7.5.2 Confounding with High Order Interaction 8.7.5.3 Incomplete Block Designs Appendix: Calculations for a Simple Blocked Experiment and Further Reading Chapter 9 Validation and Method Performance 9.1 Introduction 9.2 Assessing Precision 9.2.1 Types of Precision Estimate 9.2.1.1 Repeatability 9.2.1.2 Intermediate Precision Page 6 of 10

9.2.1.3 Reproducibility 9.2.2 Experimental Designs for Evaluating Precision 9.2.2.1 Simple Replication Studies 9.2.2.2 Nested Design 9.2.3 Precision Limits 9.2.4 Statistical Evaluation of Precision Estimates 9.2.4.1 Comparison with the Precision of an Existing Method 9.2.4.2 Comparison with a Specified Precision Value 9.3 Assessing Bias 9.3.1 Statistical Evaluation of Bias Data 9.3.1.1 Comparison with a Reference Value 9.3.1.2 Comparison with an Existing Method 9.4 Accuracy 9.5 Capability of Detection 9.5.1 Limit of Detection 9.5.2 Limit of Quantitation 9.6 Linearity and Working Range 9.7 Ruggedness 9.7.1 Planning a Ruggedness Study 9.7.2 Evaluating Data from a Ruggedness Study Chapter 10 Measurement Uncertainty 10.1 Definitions and Terminology 10.2 Principles of the ISO Guide to the Expression of Uncertainty in Measurement 10.2.1 Steps in Uncertainty Assessment 10.2.2 Specifying the Measurand 10.2.3 Identifying Sources of Uncertainty The Measurement Equation 10.2.4 Obtain Standard Uncertainties for Each Source of Uncertainty 10.2.5 Convert Uncertainties in Influence Quantities to Uncertainties in the Analytical Result 10.2.6 Combining Standard Uncertainties and Propagation of Uncertainty 10.2.7 Reporting Measurement Uncertainty 10.3 Practical Implementation 10.3.1 Using a Spreadsheet to Calculate Combined Uncertainty 10.3.2 Alternative Approaches to Uncertainty Evaluation - Using Reproducibility Data 10.4 A Basic Methodology for Uncertainty Estimation in Analytical Science Page 7 of 10

Chapter 11 Analytical Quality Control 11.1 Introduction 11.2 Shewhart Charts 11.2.1 Constructing a Shewhart chart 11.2.2 Shewhart Decision Rules 11.3 CuSum Charts 11.3.1 Constructing a CuSum Chart 11.3.2 CuSum Decision Rules Chapter 12 Proficiency Testing 12.1 Introduction 12.2 Calculation of Common Proficiency Testing Scores 12.2.1 Setting the Assigned Value and the Standard Deviation for Proficiency Assessment 12.2.1.1 Assigned Value 12.2.1.2 Standard Deviation for Proficiency Assessment 12.2.2 z-score and Q-Score 12.3 Interpreting and Acting on Proficiency Test Results 12.4 Monitoring Laboratory Performance Cumulative Scores 12.5 Ranking Laboratories in Proficiency Tests Chapter 13 Simple Sampling Strategies 13.1 Introduction 13.2 Nomenclature 13.3 Principles of Sampling 13.3.1 Randomisation 13.3.2 Representative Samples 13.3.3 Composite Samples 13.4 Sampling Strategies 13.4.1 Simple Random Sampling 13.4.1.1 Description 13.4.1.2 Advantages and Disadvantages 13.4.1.3 Statistical Treatment 13.4.2 Stratified Random Sampling 13.4.2.1 Description 13.4.2.2 Advantages and Disadvantages Page 8 of 10

13.4.2.3 Statistical Treatment 13.4.3 Systematic Sampling 13.4.3.1 Description 13.4.3.2 Advantages and Disadvantages 13.4.3.3 Statistical Treatment 13.4.4 Cluster and Multi-Stage Sampling 13.4.4.1 Description 13.4.4.2 Advantages and Disadvantages 13.4.4.3 Statistical Treatment 13.4.5 Quota Sampling 13.4.5.1 Description 13.4.5.2 Advantages and Disadvantages 13.4.5.3 Statistical Treatment 13.4.6 Sequential Sampling 13.4.6.1 Description 13.4.6.2 Advantages and Statistical Treatment 13.4.7 Judgement Sampling 13.4.7.1 Description 13.4.7.2 Advantages and Disadvantages 13.4.7.3 Statistical Treatment 13.4.8 Convenience Sampling 13.4.8.1 Description 13.4.8.2 Advantages and Disadvantages 13.4.8.3 Statistical Treatment 13.4.9 Sampling in Two Dimensions 13.4.9.1 Description 13.4.9.2 Statistical Treatment 13.5 Uncertainties Associated with Sampling 13.6 Conclusions and Further Reading Appendix A Statistical Tables Table 1a Critical values for the Dixon test (95% confidence, two-tailed) Table 1b Critical values for the Dixon test (99% confidence, two-tailed) Table 2 Critical values for the Grubbs tests Table 3a Critical values for the Cochran test (95% confidence) Table 3b Critical values for the Cochran test (99% confidence) Table 4 Critical values for the Student t-test Table 5a Critical values for the F-test (95% confidence) Page 9 of 10

Table 5b Critical values for the F-test (97.5% confidence) Table 5c Critical values for the F-test (99% confidence) Table 5d Critical values for the F-test (99.5% confidence) Appendix B Symbols, Abbreviations and Notation Appendix C Questions and Solutions Page 10 of 10

http://www.springer.com/978-0-85404-131-2