Chapter 4. Probability and Statistics. Probability and Statistics

Size: px
Start display at page:

Download "Chapter 4. Probability and Statistics. Probability and Statistics"

Transcription

1 Chapter 4 Probability and Statistics Figliola and Beasley, (999) Probability and Statistics Engineering measurements taken repeatedly under seemingly ideal conditions will normally show variability. Measurement system Resolution Repeatability Measurement procedure and technique Repeatability Probability and Statistics Measured variable Temporal variation Spatial variation We want:. A single representative value that best characterizes the average of the data set.. A measure of the variation in a measured data set.

2 True Value True value is represented by X' = X ± ux X = most probable estimate of X u x = confidence interval at probability P% which is based on estimates of precision and bias error. Probability Density Function Central Tendency says that there is one central value about which all other values tend to be scattered. Probability Density the frequency with which the measured variable takes on a value within a given interval. The region where observations tend to gather is around the central value. Plotting Histograms The abscissa is divided in K small intervals between the minimum and maximum values. The abscissa will be divided between the maximum and minimum measured values of x into K small intervals. Let the number of times, nj, that a measured value assumes a value within an interval defined by x-δx x < x+δx be plotted on the ordinate. For small, K should be chosen so that n j 5 for at least one interval.

3 Plotting Histograms For >40 ; K=.87(-) The histogram displays the tendency and density. If the y-axis is normalized by dividing n j /, a frequency distribution results. K=.87(-) =7 Cell range: min = 0.68 ; max =.34 δ x = ( )/7 = 0.0 x-δx x < x+δx Cell = *δx = 0.0 ote: total frequency equals 00% ote: total occurrence equals Probability Density Function P(x) comes from the frequency distribution Px ( ) = lim ( nj) / ( ( δx)), δ x 0 - Defines the probability that a measured variable might assume a particular value on any given observation, and also provides the central tendency - The shape depends on the variable in consideration and its natural circumstances/processes. - Plot histograms compare to common distribution and then fit the parameter. - Unifit is a good PC-based distribution fitting software. 3

4 Infinite Statistics The most common distribution is the normal or Gaussian distribution. This predicts that the distribution will be evenly distributed around the central tendency. ( bell curve ) The operating conditions that are held fixed are length, temperature, pressure, and velocity. Gaussian Distribution Pdf for Gaussian: / px ( ) = / ( v( π) exp [( / )(( x x') / σ )] x = true mean of x ; σ = true variance of x To find/predict the probability that a future measurement falls within some interval. Probability P(x) is the area under the curve between X ±δx on p(x) 4

5 Gaussian Distribution Simplification of integration: Px ( ' x x x x x ' x ) ' + δ δ + δ = pxdx ( ) x' δx P(x)=probability p(x)=pdf if z =(x -x )/σ and β=(x-x )/σ dx=σdβ and P ( z z ) /( ) z / z e β / β = π d β β=standardized normal variant ; z =interval on p(x) Table 4.3 Solutions given for this in table 4.3 For z=, 68.7% of observations within standard deviation of x For z=, 95.45% For z=3, 99.73% The values of z in table 4.3 can be used to predict probability of a unique value occurring in an infinite data set. The Z Table 5

6 Finite Statistics When <<, we do not have true representation of the population. We have finite statistics which describe the sample and estimate the population. Sample mean (probable estimate of true mean) x = / xi i= Sample variance (measurement of precision) x i= s = /( ) ( xi x) Finite Statistics con t: Deviation of x i : x x Sample standard deviation: sx = s i Degree of freedom: the number of samples less the central tendency measurement (-) x t-estimate For a finite data set, we use the t-estimate instead of z, which was used in the infinite. One can state that ± tvps xi = x ± tv, psx (P%), x represents the precision interval P% = probability v = degrees of freedom x = sample mean Table 4.4 gives t distribution 6

7 Standard Deviation of Means If we measure a variable times under fixed conditions, and replicate this M times, we will end up with slightly different means for each replication. It can be shown that regardless of the form of the pdf of the individualized replication, the mean values themselves will be normally distributed. Variation depends on. The sample variance s x. The sample size sx = sx/ / Difference increases as s x increases and decreases with / The expected variance of means can be estimated by a single finite data set Pooled Statistics Samples that are grouped in a manner so as to determine a common set of statistics are called pooled. If we have M replicates of variable x, with repeated measurements producing data set x i,j ; i = to ; j = to M 7

8 Pooled mean of x: x = / M xij j= i= Pooled standard deviation of x: (with v=m(-) degrees of freedom) Pooled standard deviation of means: M sx = [ /( M( )) ( xij xj) ] = M /M sxj j= M j= i= sx = sx /( M) / Chi Squared Distribution Estimates the precision by which s x predicts σ. s x = sample variance ; σ = population variance If we plotted s x for many sets having samples each, we would generate the pdf for P(χ ) (chi-squared) For a normal distribution chi-squared χ = ν(s / σ ) x ν =- 8

9 Precision Interval in Sample Variance The precision interval for the sample variance can be formulated by the probability statement: P( χ χ α χ α ) = α / / with a probability of P(χ)= - α ; α = level of significance Combining: [ x / α / x / α / ] For 95% precision interval by which s x estimates σ : νs x χ σ νs /. 05 < < x / χ. 975 P νs χ σ νs χ = α Precision Interval in Sample Variance The χ distribution estimates the discrepancy expected as a result of random chance. Values for χ α are tabulated in Table 4.5 as a function of the degrees of freedom. The P(χ ) value equals the area under p(χ ) as measured from the left, and the α value is the area as measured from the right. The total area under p(χ ) is equal to unity. 9

10 Values for χ α Goodness of Fit Test We can use a chi-squared test to determine how good of a fit our selected pdf represents the actual distribution of data. The χ test gives us the measure of error between the variation in the data set and variation predicted by the assumed pdf. Construct histogram of data and histogram from predicted pdf, where n j is actual and n j is predicted number of occurrences per cell. Then: k χ = ( nj n' j) / nj j= For given d.o.f. the better fit gives lower χ χ α Goodness of Fit Test The table, given in the previous slide, can be interpreted as a measure of the discrepancy expected as a result of random chance. For example, a value for α of 0.95 implies that 95% of the discrepancy between the histogram and the assumed distribution is due to random variation only. 0

11 Goodness of Fit Test With P(χ ) = - α, this leaves only 5% of the discrepancy caused by a systematic tendency, such as different distribution. In general, a P(χ ) < 0.05 confers a very strong measure of a good fit to the assumed distribution, an unequivocal result. Regression Analysis Regression analysis is used to establish a functional relationship between the independent and dependent variables. It is assumed that the relationship can be described by either a polynomial or a fourier series. It is also assumed that the variation in the dependent variable is normally distributed about a fixed value of the independent variable. Regression Analysis Such behavior is illustrated in the previous slide by considering the dependent variable y i,j consisting of measurements, i =,,,, of y at each of n values of independent variable, x j, j =,,, n. This behavior is most common during calibrations, where the input to the measuring system, x, is held nominally fixed while the measurement of y occurs.

12 Regression Analysis Repeated measurements of y will yield a normal distribution with variance S y( xj), about some mean value, y( xj). mth Order Polynomials (least squares regression) y i =a 0 +a x+a x + +a m x m gives the estimate of y i at independent variable value χ I -x. The standard deviation of the deviation of each point is given by: syx = (( ( yi yci) / ν) / i= d.o.f: ν= - (m+) ; = observations, m = order

13 Linear Polynomials For linear polynomials, a correlation coefficient, r, can be found by: r = syx ( ) s y Linear Polynomials The correlation coefficient represents a quantitative measure of the linear association between x and y. It is bounded by ±, which represents prefect correlation; the sign indicates that y increases or decreases with x. For ±0.9 < r < ±, a linear regression can be considered as a reliable relation between y and x. Variance Sy = /( ) ( yi y) i= For ±0.4 < r ± is considered a good fit between x and y. r is often used, but these are not true measures of the precision of y i. S yx should be used for precision. 3

14 Data Outlier Detection It is not uncommon for a data set to contain one or more spurious data points that appear to be out of range of expected trends. Data that lie outside the probability of normal variation can bias the sample mean and variance. Statistical methods and histograms are helpful to identify outliers. If removed, data needs to be re-evaluated for stats. Method Method (Three Sigma Test) Use area under Pdf to find basis for outlier detection Calculate data set statistics Discard all points outside x± tν99. 8Sx Good for n > 0, easy to program Method Method (modified three sigma test for large data sets) Estimate the sample mean and standard deviation Compute modified z variable for each data point: z0 = ( xi x)/ sx Probability that x lies outside one-sided range from 0 to z 0 is 0.5-P(z 0 ), where P(z 0 ) is found from the one-sided z chart. For data points, if [0.5-P(z 0 )] 0. data is outlier. Example 4. 4

15 umber of Measurements Required How many observations are required to estimate the true mean x, with acceptable precision? Earlier we saw that precision interval is a quantified measure of precision error in the estimate of true mean. / x' = x± τ v, psx sx = sx/ Based on sample mean and its precision interval: / x' = x± tv, 95sx/ umber of Measurements Required: 95% confidence interval: CI =± tv, 95sx/ / (95%) To evaluate, we must have an estimate of S x, based on previous test data, experience, or manufacturer specs. sx = standard deviation of means sx = standard deviation of samples umber of Measurements Required: Precision interval is two-sided about the mean. / / x tv, 95sx/ x' < x+ tv, 95sx/ One-sided precision value: / d = CI/ = tv, 95sx/ Then required measurements: ( tv,95sx/ d) P= (95%) -approximation due to S x 5

16 umber of Measurements Required: The approximation serves as a reminder that this expression is based in an assumed value for S x. The accuracy of equation 4.43 will depend on how well the assumed value for S x approximates σ. The obvious deficiency in this method is that an estimate for the sample variance is needed. One way around this is to make a preliminary small number of measurements,, to obtain an estimate of the sample variance, S, to be expected. umber of Measurements Required: The total number of measurements, T, will be estimated by: t T (. 95 d S ) (95%) This establishes that T additional measurements will be required. 6

Probability & Statistics

Probability & Statistics MECE 330 MECE 330 Measurements & Instrumentation Probability & tatistics Dr. Isaac Choutapalli Department of Mechanical Engineering University of Teas Pan American MECE 330 Introduction uppose we have

More information

Chapter 3. Second Order Measurement System Behaviour

Chapter 3. Second Order Measurement System Behaviour Chapter 3 Second Order Measurement System Behaviour 1 GENERAL MEASUREMENT SYSTEM MODEL INITIAL CONDITIONS y(0) SIGNAL INPUT F(t) MEASUREMENT SYSTEM SYSTEM OUTPUT y(t) Understanding the theoretical response

More information

Coverage: through class #7 material (updated June 14, 2017) Terminology/miscellaneous. for a linear instrument:

Coverage: through class #7 material (updated June 14, 2017) Terminology/miscellaneous. for a linear instrument: Boston University ME 310: Summer 017 Course otes to-date Prof. C. Farny DISCLAIMER: This document is meant to serve as a reference and an overview, and not an exclusive study guide for the course. Coverage:

More information

Measurement And Uncertainty

Measurement And Uncertainty Measurement And Uncertainty Based on Guidelines for Evaluating and Expressing the Uncertainty of NIST Measurement Results, NIST Technical Note 1297, 1994 Edition PHYS 407 1 Measurement approximates or

More information

STATISTICS OF OBSERVATIONS & SAMPLING THEORY. Parent Distributions

STATISTICS OF OBSERVATIONS & SAMPLING THEORY. Parent Distributions ASTR 511/O Connell Lec 6 1 STATISTICS OF OBSERVATIONS & SAMPLING THEORY References: Bevington Data Reduction & Error Analysis for the Physical Sciences LLM: Appendix B Warning: the introductory literature

More information

Applied Statistics I

Applied Statistics I Applied Statistics I Liang Zhang Department of Mathematics, University of Utah July 17, 2008 Liang Zhang (UofU) Applied Statistics I July 17, 2008 1 / 23 Large-Sample Confidence Intervals Liang Zhang (UofU)

More information

Algebra of Random Variables: Optimal Average and Optimal Scaling Minimising

Algebra of Random Variables: Optimal Average and Optimal Scaling Minimising Review: Optimal Average/Scaling is equivalent to Minimise χ Two 1-parameter models: Estimating < > : Scaling a pattern: Two equivalent methods: Algebra of Random Variables: Optimal Average and Optimal

More information

ME EN 363 Elementary Instrumentation

ME EN 363 Elementary Instrumentation ME E 6 Elementary Instrumentation Curve Fitting Least squares regression analysis Standard error of fit Correlation coefficient Error in static sensitivity Regression According to various Internet resources,

More information

Introduction to Error Analysis

Introduction to Error Analysis Introduction to Error Analysis Part 1: the Basics Andrei Gritsan based on lectures by Petar Maksimović February 1, 2010 Overview Definitions Reporting results and rounding Accuracy vs precision systematic

More information

Some Statistics. V. Lindberg. May 16, 2007

Some Statistics. V. Lindberg. May 16, 2007 Some Statistics V. Lindberg May 16, 2007 1 Go here for full details An excellent reference written by physicists with sample programs available is Data Reduction and Error Analysis for the Physical Sciences,

More information

Analysis of Experimental Designs

Analysis of Experimental Designs Analysis of Experimental Designs p. 1/? Analysis of Experimental Designs Gilles Lamothe Mathematics and Statistics University of Ottawa Analysis of Experimental Designs p. 2/? Review of Probability A short

More information

The sample mean and sample variance are given by: x sample standard deviation Excel: STDEV(values)

The sample mean and sample variance are given by: x sample standard deviation Excel: STDEV(values) Unless we have made a very large number of measurements, we don't have an accurate estimate of the mean or standard deviation of a data set. If we assume the values are normally distributed, we can estimate

More information

IV. The Normal Distribution

IV. The Normal Distribution IV. The Normal Distribution The normal distribution (a.k.a., the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact

More information

Algebra of Random Variables: Optimal Average and Optimal Scaling Minimising

Algebra of Random Variables: Optimal Average and Optimal Scaling Minimising Review: Optimal Average/Scaling is equivalent to Minimise χ Two 1-parameter models: Estimating < > : Scaling a pattern: Two equivalent methods: Algebra of Random Variables: Optimal Average and Optimal

More information

Introduction to Statistics and Error Analysis II

Introduction to Statistics and Error Analysis II Introduction to Statistics and Error Analysis II Physics116C, 4/14/06 D. Pellett References: Data Reduction and Error Analysis for the Physical Sciences by Bevington and Robinson Particle Data Group notes

More information

Chapter 11 Sampling Distribution. Stat 115

Chapter 11 Sampling Distribution. Stat 115 Chapter 11 Sampling Distribution Stat 115 1 Definition 11.1 : Random Sample (finite population) Suppose we select n distinct elements from a population consisting of N elements, using a particular probability

More information

Lecture 18: Simple Linear Regression

Lecture 18: Simple Linear Regression Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength

More information

IV. The Normal Distribution

IV. The Normal Distribution IV. The Normal Distribution The normal distribution (a.k.a., a the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact

More information

Statistics: Error (Chpt. 5)

Statistics: Error (Chpt. 5) Statistics: Error (Chpt. 5) Always some amount of error in every analysis (How much can you tolerate?) We examine error in our measurements to know reliably that a given amount of analyte is in the sample

More information

Tables Table A Table B Table C Table D Table E 675

Tables Table A Table B Table C Table D Table E 675 BMTables.indd Page 675 11/15/11 4:25:16 PM user-s163 Tables Table A Standard Normal Probabilities Table B Random Digits Table C t Distribution Critical Values Table D Chi-square Distribution Critical Values

More information

An Introduction to Error Analysis

An Introduction to Error Analysis An Introduction to Error Analysis Introduction The following notes (courtesy of Prof. Ditchfield) provide an introduction to quantitative error analysis: the study and evaluation of uncertainty in measurement.

More information

-However, this definition can be expanded to include: biology (biometrics), environmental science (environmetrics), economics (econometrics).

-However, this definition can be expanded to include: biology (biometrics), environmental science (environmetrics), economics (econometrics). Chemometrics Application of mathematical, statistical, graphical or symbolic methods to maximize chemical information. -However, this definition can be expanded to include: biology (biometrics), environmental

More information

Introduction to statistics

Introduction to statistics Introduction to statistics Literature Raj Jain: The Art of Computer Systems Performance Analysis, John Wiley Schickinger, Steger: Diskrete Strukturen Band 2, Springer David Lilja: Measuring Computer Performance:

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness

More information

Confidence Intervals for the Sample Mean

Confidence Intervals for the Sample Mean Confidence Intervals for the Sample Mean As we saw before, parameter estimators are themselves random variables. If we are going to make decisions based on these uncertain estimators, we would benefit

More information

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11.

1.0 Continuous Distributions. 5.0 Shapes of Distributions. 6.0 The Normal Curve. 7.0 Discrete Distributions. 8.0 Tolerances. 11. Chapter 4 Statistics 45 CHAPTER 4 BASIC QUALITY CONCEPTS 1.0 Continuous Distributions.0 Measures of Central Tendency 3.0 Measures of Spread or Dispersion 4.0 Histograms and Frequency Distributions 5.0

More information

Bias Variance Trade-off

Bias Variance Trade-off Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]

More information

ik () uk () Today s menu Last lecture Some definitions Repeatability of sensing elements

ik () uk () Today s menu Last lecture Some definitions Repeatability of sensing elements Last lecture Overview of the elements of measurement systems. Sensing elements. Signal conditioning elements. Signal processing elements. Data presentation elements. Static characteristics of measurement

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

STATISTICS ANCILLARY SYLLABUS. (W.E.F. the session ) Semester Paper Code Marks Credits Topic

STATISTICS ANCILLARY SYLLABUS. (W.E.F. the session ) Semester Paper Code Marks Credits Topic STATISTICS ANCILLARY SYLLABUS (W.E.F. the session 2014-15) Semester Paper Code Marks Credits Topic 1 ST21012T 70 4 Descriptive Statistics 1 & Probability Theory 1 ST21012P 30 1 Practical- Using Minitab

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

Chapter 8 - Statistical intervals for a single sample

Chapter 8 - Statistical intervals for a single sample Chapter 8 - Statistical intervals for a single sample 8-1 Introduction In statistics, no quantity estimated from data is known for certain. All estimated quantities have probability distributions of their

More information

II. The Normal Distribution

II. The Normal Distribution II. The Normal Distribution The normal distribution (a.k.a., a the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact

More information

APPENDICES APPENDIX A. STATISTICAL TABLES AND CHARTS 651 APPENDIX B. BIBLIOGRAPHY 677 APPENDIX C. ANSWERS TO SELECTED EXERCISES 679

APPENDICES APPENDIX A. STATISTICAL TABLES AND CHARTS 651 APPENDIX B. BIBLIOGRAPHY 677 APPENDIX C. ANSWERS TO SELECTED EXERCISES 679 APPENDICES APPENDIX A. STATISTICAL TABLES AND CHARTS 1 Table I Summary of Common Probability Distributions 2 Table II Cumulative Standard Normal Distribution Table III Percentage Points, 2 of the Chi-Squared

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Statistical Methods for Astronomy

Statistical Methods for Astronomy Statistical Methods for Astronomy Probability (Lecture 1) Statistics (Lecture 2) Why do we need statistics? Useful Statistics Definitions Error Analysis Probability distributions Error Propagation Binomial

More information

STAT420 Midterm Exam. University of Illinois Urbana-Champaign October 19 (Friday), :00 4:15p. SOLUTIONS (Yellow)

STAT420 Midterm Exam. University of Illinois Urbana-Champaign October 19 (Friday), :00 4:15p. SOLUTIONS (Yellow) STAT40 Midterm Exam University of Illinois Urbana-Champaign October 19 (Friday), 018 3:00 4:15p SOLUTIONS (Yellow) Question 1 (15 points) (10 points) 3 (50 points) extra ( points) Total (77 points) Points

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Statistical Data Analysis Stat 3: p-values, parameter estimation

Statistical Data Analysis Stat 3: p-values, parameter estimation Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,

More information

Chemometrics. Matti Hotokka Physical chemistry Åbo Akademi University

Chemometrics. Matti Hotokka Physical chemistry Åbo Akademi University Chemometrics Matti Hotokka Physical chemistry Åbo Akademi University Hypothesis testing Inference method Confidence levels Descriptive statistics Hypotesis testing Predictive statistics Hypothesis testing

More information

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution

More information

Advanced Engineering Statistics - Section 5 - Jay Liu Dept. Chemical Engineering PKNU

Advanced Engineering Statistics - Section 5 - Jay Liu Dept. Chemical Engineering PKNU Advanced Engineering Statistics - Section 5 - Jay Liu Dept. Chemical Engineering PKNU Least squares regression What we will cover Box, G.E.P., Use and abuse of regression, Technometrics, 8 (4), 625-629,

More information

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population Lecture 5 1 Lecture 3 The Population Variance The population variance, denoted σ 2, is the sum of the squared deviations about the population mean divided by the number of observations in the population,

More information

Objective Experiments Glossary of Statistical Terms

Objective Experiments Glossary of Statistical Terms Objective Experiments Glossary of Statistical Terms This glossary is intended to provide friendly definitions for terms used commonly in engineering and science. It is not intended to be absolutely precise.

More information

Ch. 7. One sample hypothesis tests for µ and σ

Ch. 7. One sample hypothesis tests for µ and σ Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math

More information

Statistics notes. A clear statistical framework formulates the logic of what we are doing and why. It allows us to make precise statements.

Statistics notes. A clear statistical framework formulates the logic of what we are doing and why. It allows us to make precise statements. Statistics notes Introductory comments These notes provide a summary or cheat sheet covering some basic statistical recipes and methods. These will be discussed in more detail in the lectures! What is

More information

Inference in Regression Analysis

Inference in Regression Analysis Inference in Regression Analysis Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 4, Slide 1 Today: Normal Error Regression Model Y i = β 0 + β 1 X i + ǫ i Y i value

More information

Error Analysis How Do We Deal With Uncertainty In Science.

Error Analysis How Do We Deal With Uncertainty In Science. How Do We Deal With Uncertainty In Science. 1 Error Analysis - the study and evaluation of uncertainty in measurement. 2 The word error does not mean mistake or blunder in science. 3 Experience shows no

More information

Topic 2: Probability & Distributions. Road Map Probability & Distributions. ECO220Y5Y: Quantitative Methods in Economics. Dr.

Topic 2: Probability & Distributions. Road Map Probability & Distributions. ECO220Y5Y: Quantitative Methods in Economics. Dr. Topic 2: Probability & Distributions ECO220Y5Y: Quantitative Methods in Economics Dr. Nick Zammit University of Toronto Department of Economics Room KN3272 n.zammit utoronto.ca November 21, 2017 Dr. Nick

More information

Practical Statistics for the Analytical Scientist Table of Contents

Practical Statistics for the Analytical Scientist Table of Contents Practical Statistics for the Analytical Scientist Table of Contents Chapter 1 Introduction - Choosing the Correct Statistics 1.1 Introduction 1.2 Choosing the Right Statistical Procedures 1.2.1 Planning

More information

Statistical Analysis of Engineering Data The Bare Bones Edition. Precision, Bias, Accuracy, Measures of Precision, Propagation of Error

Statistical Analysis of Engineering Data The Bare Bones Edition. Precision, Bias, Accuracy, Measures of Precision, Propagation of Error Statistical Analysis of Engineering Data The Bare Bones Edition (I) Precision, Bias, Accuracy, Measures of Precision, Propagation of Error PRIOR TO DATA ACQUISITION ONE SHOULD CONSIDER: 1. The accuracy

More information

Lecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis

Lecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis Lecture 3 1 Probability (90 min.) Definition, Bayes theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests (90 min.) general concepts, test statistics,

More information

Introduction to Uncertainty. Asma Khalid and Muhammad Sabieh Anwar

Introduction to Uncertainty. Asma Khalid and Muhammad Sabieh Anwar Introduction to Uncertainty Asma Khalid and Muhammad Sabieh Anwar 1 Measurements and uncertainties Uncertainty Types of uncertainty Standard uncertainty Confidence Intervals Expanded Uncertainty Examples

More information

Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics)

Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics) Brandon C. Kelly (Harvard Smithsonian Center for Astrophysics) Probability quantifies randomness and uncertainty How do I estimate the normalization and logarithmic slope of a X ray continuum, assuming

More information

n i n T Note: You can use the fact that t(.975; 10) = 2.228, t(.95; 10) = 1.813, t(.975; 12) = 2.179, t(.95; 12) =

n i n T Note: You can use the fact that t(.975; 10) = 2.228, t(.95; 10) = 1.813, t(.975; 12) = 2.179, t(.95; 12) = MAT 3378 3X Midterm Examination (Solutions) 1. An experiment with a completely randomized design was run to determine whether four specific firing temperatures affect the density of a certain type of brick.

More information

Application Note AN37. Noise Histogram Analysis. by John Lis

Application Note AN37. Noise Histogram Analysis. by John Lis AN37 Application Note Noise Histogram Analysis by John Lis NOISELESS, IDEAL CONVERTER OFFSET ERROR σ RMS NOISE HISTOGRAM OF SAMPLES PROBABILITY DISTRIBUTION FUNCTION X PEAK-TO-PEAK NOISE Crystal Semiconductor

More information

Data Fitting - Lecture 6

Data Fitting - Lecture 6 1 Central limit theorem Data Fitting - Lecture 6 Recall that for a sequence of independent, random variables, X i, one can define a mean, µ i, and a variance σi. then the sum of the means also forms a

More information

Chapter 2: Statistical Methods. 4. Total Measurement System and Errors. 2. Characterizing statistical distribution. 3. Interpretation of Results

Chapter 2: Statistical Methods. 4. Total Measurement System and Errors. 2. Characterizing statistical distribution. 3. Interpretation of Results 36 Chapter : Statistical Methods 1. Introduction. Characterizing statistical distribution 3. Interpretation of Results 4. Total Measurement System and Errors 5. Regression Analysis 37 1.Introduction The

More information

Relate Attributes and Counts

Relate Attributes and Counts Relate Attributes and Counts This procedure is designed to summarize data that classifies observations according to two categorical factors. The data may consist of either: 1. Two Attribute variables.

More information

Uncertainty due to Finite Resolution Measurements

Uncertainty due to Finite Resolution Measurements Uncertainty due to Finite Resolution Measurements S.D. Phillips, B. Tolman, T.W. Estler National Institute of Standards and Technology Gaithersburg, MD 899 Steven.Phillips@NIST.gov Abstract We investigate

More information

Statistics for Data Analysis. Niklaus Berger. PSI Practical Course Physics Institute, University of Heidelberg

Statistics for Data Analysis. Niklaus Berger. PSI Practical Course Physics Institute, University of Heidelberg Statistics for Data Analysis PSI Practical Course 2014 Niklaus Berger Physics Institute, University of Heidelberg Overview You are going to perform a data analysis: Compare measured distributions to theoretical

More information

Statistical Models with Uncertain Error Parameters (G. Cowan, arxiv: )

Statistical Models with Uncertain Error Parameters (G. Cowan, arxiv: ) Statistical Models with Uncertain Error Parameters (G. Cowan, arxiv:1809.05778) Workshop on Advanced Statistics for Physics Discovery aspd.stat.unipd.it Department of Statistical Sciences, University of

More information

Linear Regression Models

Linear Regression Models Linear Regression Models Model Description and Model Parameters Modelling is a central theme in these notes. The idea is to develop and continuously improve a library of predictive models for hazards,

More information

Statistical Methods in Particle Physics

Statistical Methods in Particle Physics Statistical Methods in Particle Physics Lecture 10 December 17, 01 Silvia Masciocchi, GSI Darmstadt Winter Semester 01 / 13 Method of least squares The method of least squares is a standard approach to

More information

STATISTICS 141 Final Review

STATISTICS 141 Final Review STATISTICS 141 Final Review Bin Zou bzou@ualberta.ca Department of Mathematical & Statistical Sciences University of Alberta Winter 2015 Bin Zou (bzou@ualberta.ca) STAT 141 Final Review Winter 2015 1 /

More information

REVIEW: Midterm Exam. Spring 2012

REVIEW: Midterm Exam. Spring 2012 REVIEW: Midterm Exam Spring 2012 Introduction Important Definitions: - Data - Statistics - A Population - A census - A sample Types of Data Parameter (Describing a characteristic of the Population) Statistic

More information

Other Continuous Probability Distributions

Other Continuous Probability Distributions CHAPTER Probability, Statistics, and Reliability for Engineers and Scientists Second Edition PROBABILITY DISTRIBUTION FOR CONTINUOUS RANDOM VARIABLES A. J. Clar School of Engineering Department of Civil

More information

Lecture 3. Probability - Part 2. Luigi Freda. ALCOR Lab DIAG University of Rome La Sapienza. October 19, 2016

Lecture 3. Probability - Part 2. Luigi Freda. ALCOR Lab DIAG University of Rome La Sapienza. October 19, 2016 Lecture 3 Probability - Part 2 Luigi Freda ALCOR Lab DIAG University of Rome La Sapienza October 19, 2016 Luigi Freda ( La Sapienza University) Lecture 3 October 19, 2016 1 / 46 Outline 1 Common Continuous

More information

Coverage: through entire course material (updated July 18, 2017) Terminology/miscellaneous. for a linear instrument:

Coverage: through entire course material (updated July 18, 2017) Terminology/miscellaneous. for a linear instrument: Boston University ME 310: Summer 017 Course Notes Prof. C. Farny DISCLAIMER: This document is meant to serve as a reference and an overview, and not an exclusive study guide for the course. Coverage: through

More information

STA1000F Summary. Mitch Myburgh MYBMIT001 May 28, Work Unit 1: Introducing Probability

STA1000F Summary. Mitch Myburgh MYBMIT001 May 28, Work Unit 1: Introducing Probability STA1000F Summary Mitch Myburgh MYBMIT001 May 28, 2015 1 Module 1: Probability 1.1 Work Unit 1: Introducing Probability 1.1.1 Definitions 1. Random Experiment: A procedure whose outcome (result) in a particular

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

Chapter 8 of Devore , H 1 :

Chapter 8 of Devore , H 1 : Chapter 8 of Devore TESTING A STATISTICAL HYPOTHESIS Maghsoodloo A statistical hypothesis is an assumption about the frequency function(s) (i.e., PDF or pdf) of one or more random variables. Stated in

More information

Introduction to Statistics and Error Analysis

Introduction to Statistics and Error Analysis Introduction to Statistics and Error Analysis Physics116C, 4/3/06 D. Pellett References: Data Reduction and Error Analysis for the Physical Sciences by Bevington and Robinson Particle Data Group notes

More information

Probability & Statistics: Infinite Statistics. Robert Leishman Mark Colton ME 363 Spring 2011

Probability & Statistics: Infinite Statistics. Robert Leishman Mark Colton ME 363 Spring 2011 Probability & Statistics: Infinite Statistics Robert Leishman Mark Colton ME 363 Spring 0 Large Data Sets What happens to a histogram as N becomes large (N )? Number of bins becomes large (K ) Width of

More information

SIO 221B, Rudnick adapted from Davis 1. 1 x lim. N x 2 n = 1 N. { x} 1 N. N x = 1 N. N x = 1 ( N N x ) x = 0 (3) = 1 x N 2

SIO 221B, Rudnick adapted from Davis 1. 1 x lim. N x 2 n = 1 N. { x} 1 N. N x = 1 N. N x = 1 ( N N x ) x = 0 (3) = 1 x N 2 SIO B, Rudnick adapted from Davis VII. Sampling errors We do not have access to the true statistics, so we must compute sample statistics. By this we mean that the number of realizations we average over

More information

Theoretical Probability Models

Theoretical Probability Models CHAPTER Duxbury Thomson Learning Maing Hard Decision Third Edition Theoretical Probability Models A. J. Clar School of Engineering Department of Civil and Environmental Engineering 9 FALL 003 By Dr. Ibrahim.

More information

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College 1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 03 The Chi-Square Distributions Dr. Neal, Spring 009 The chi-square distributions can be used in statistics to analyze the standard deviation of a normally distributed measurement and to test the

More information

Simple Linear Regression for the Climate Data

Simple Linear Regression for the Climate Data Prediction Prediction Interval Temperature 0.2 0.0 0.2 0.4 0.6 0.8 320 340 360 380 CO 2 Simple Linear Regression for the Climate Data What do we do with the data? y i = Temperature of i th Year x i =CO

More information

STAT 111 Recitation 7

STAT 111 Recitation 7 STAT 111 Recitation 7 Xin Lu Tan xtan@wharton.upenn.edu October 25, 2013 1 / 13 Miscellaneous Please turn in homework 6. Please pick up homework 7 and the graded homework 5. Please check your grade and

More information

Robustness and Distribution Assumptions

Robustness and Distribution Assumptions Chapter 1 Robustness and Distribution Assumptions 1.1 Introduction In statistics, one often works with model assumptions, i.e., one assumes that data follow a certain model. Then one makes use of methodology

More information

Lecture 12: Small Sample Intervals Based on a Normal Population Distribution

Lecture 12: Small Sample Intervals Based on a Normal Population Distribution Lecture 12: Small Sample Intervals Based on a Normal Population MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 24 In this lecture, we will discuss (i)

More information

1; (f) H 0 : = 55 db, H 1 : < 55.

1; (f) H 0 : = 55 db, H 1 : < 55. Reference: Chapter 8 of J. L. Devore s 8 th Edition By S. Maghsoodloo TESTING a STATISTICAL HYPOTHESIS A statistical hypothesis is an assumption about the frequency function(s) (i.e., pmf or pdf) of one

More information

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College 1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College An example ANOVA situation Example (Treating Blisters) Subjects: 25 patients with blisters Treatments: Treatment A, Treatment

More information

Simple Linear Regression Analysis

Simple Linear Regression Analysis LINEAR REGRESSION ANALYSIS MODULE II Lecture - 6 Simple Linear Regression Analysis Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Prediction of values of study

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

MATH c UNIVERSITY OF LEEDS Examination for the Module MATH1725 (May-June 2009) INTRODUCTION TO STATISTICS. Time allowed: 2 hours

MATH c UNIVERSITY OF LEEDS Examination for the Module MATH1725 (May-June 2009) INTRODUCTION TO STATISTICS. Time allowed: 2 hours 01 This question paper consists of 11 printed pages, each of which is identified by the reference. Only approved basic scientific calculators may be used. Statistical tables are provided at the end of

More information

Statistical Hypothesis Testing

Statistical Hypothesis Testing Statistical Hypothesis Testing Dr. Phillip YAM 2012/2013 Spring Semester Reference: Chapter 7 of Tests of Statistical Hypotheses by Hogg and Tanis. Section 7.1 Tests about Proportions A statistical hypothesis

More information

Practical Statistics

Practical Statistics Practical Statistics Lecture 1 (Nov. 9): - Correlation - Hypothesis Testing Lecture 2 (Nov. 16): - Error Estimation - Bayesian Analysis - Rejecting Outliers Lecture 3 (Nov. 18) - Monte Carlo Modeling -

More information

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning SMA 6304 / MIT 2.853 / MIT 2.854 Manufacturing Systems Lecture 10: Data and Regression Analysis Lecturer: Prof. Duane S. Boning 1 Agenda 1. Comparison of Treatments (One Variable) Analysis of Variance

More information

arxiv:hep-ex/ v1 2 Jun 2000

arxiv:hep-ex/ v1 2 Jun 2000 MPI H - V7-000 May 3, 000 Averaging Measurements with Hidden Correlations and Asymmetric Errors Michael Schmelling / MPI for Nuclear Physics Postfach 03980, D-6909 Heidelberg arxiv:hep-ex/0006004v Jun

More information

Statistics. Statistics

Statistics. Statistics The main aims of statistics 1 1 Choosing a model 2 Estimating its parameter(s) 1 point estimates 2 interval estimates 3 Testing hypotheses Distributions used in statistics: χ 2 n-distribution 2 Let X 1,

More information

Data Analysis II. CU- Boulder CHEM-4181 Instrumental Analysis Laboratory. Prof. Jose-Luis Jimenez Spring 2007

Data Analysis II. CU- Boulder CHEM-4181 Instrumental Analysis Laboratory. Prof. Jose-Luis Jimenez Spring 2007 Data Analysis II CU- Boulder CHEM-48 Instrumental Analysis Laboratory Prof. Jose-Luis Jimenez Spring 007 Lecture will be posted on course web page based on lab manual, Skoog, web links Summary of Last

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Exploring Data: Distributions Look for overall pattern (shape, center, spread) and deviations (outliers). Mean (use a calculator): x = x 1 + x

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

Generalized Additive Models

Generalized Additive Models Generalized Additive Models The Model The GLM is: g( µ) = ß 0 + ß 1 x 1 + ß 2 x 2 +... + ß k x k The generalization to the GAM is: g(µ) = ß 0 + f 1 (x 1 ) + f 2 (x 2 ) +... + f k (x k ) where the functions

More information

Homework for 1/13 Due 1/22

Homework for 1/13 Due 1/22 Name: ID: Homework for 1/13 Due 1/22 1. [ 5-23] An irregularly shaped object of unknown area A is located in the unit square 0 x 1, 0 y 1. Consider a random point distributed uniformly over the square;

More information

7.1 Basic Properties of Confidence Intervals

7.1 Basic Properties of Confidence Intervals 7.1 Basic Properties of Confidence Intervals What s Missing in a Point Just a single estimate What we need: how reliable it is Estimate? No idea how reliable this estimate is some measure of the variability

More information

p(z)

p(z) Chapter Statistics. Introduction This lecture is a quick review of basic statistical concepts; probabilities, mean, variance, covariance, correlation, linear regression, probability density functions and

More information