Basic Statistical Analysis!

Similar documents
Statistical Analysis of Engineering Data The Bare Bones Edition. Precision, Bias, Accuracy, Measures of Precision, Propagation of Error

Give students a few minutes to reflect on Exercise 1. Then ask students to share their initial reactions and thoughts in answering the questions.

Chapter 1 Review Problems (Done in Class)

POWER UNDERSTANDING MEASUREMENT UNCERTAINTY IN DP FLOW DEVICES

Lab 1: Measurement, Uncertainty, and Uncertainty Propagation

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015

Error Analysis in Experimental Physical Science Mini-Version

Precision Correcting for Random Error

16.400/453J Human Factors Engineering. Design of Experiments II

Two-sample inference: Continuous data

This term refers to the physical quantity that is the result of the measurement activity.

appstats27.notebook April 06, 2017

Chapter 27 Summary Inferences for Regression

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

L03. PROBABILITY REVIEW II COVARIANCE PROJECTION. NA568 Mobile Robotics: Methods & Algorithms

Appendix C: Accuracy, Precision, and Uncertainty

the probability of getting either heads or tails must be 1 (excluding the remote possibility of getting it to land on its edge).

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Section 4.1 Relative Extrema 3 Lectures. Dr. Abdulla Eid. College of Science. MATHS 101: Calculus I

Ch. 7. One sample hypothesis tests for µ and σ

1.9 Algebraic Expressions

Physics 2020 Laboratory Manual

Lecture 06 Lecture 4: Part II - Uncertainty principle

Supporting Australian Mathematics Project. A guide for teachers Years 11 and 12. Probability and statistics: Module 25. Inference for means

Descriptive Statistics: cal. Is it reasonable to use a t test to test hypotheses about the mean? Hypotheses: Test Statistic: P value:

Introduction to Uncertainty and Treatment of Data

Fluid Mechanics Prof. S.K. Som Department of Mechanical Engineering Indian Institute of Technology, Kharagpur

33. SOLVING LINEAR INEQUALITIES IN ONE VARIABLE

Warm-up Using the given data Create a scatterplot Find the regression line

Chem 321 Lecture 5 - Experimental Errors and Statistics 9/10/13

If you like us, please share us on social media. The latest UCD Hyperlibrary newsletter is now complete, check it out.

Sampling Distribution Models. Chapter 17

Fitting a Straight Line to Data

Lecture 8 Hypothesis Testing

Lesson 10: True and False Equations

Week 11 Sample Means, CLT, Correlation

Inferences for Regression

Measurements and Data Analysis

(Refer Slide Time 1:25)

Brief Review of Probability

Experiment 2. Reaction Time. Make a series of measurements of your reaction time. Use statistics to analyze your reaction time.

Solving with Absolute Value

4.1 Hypothesis Testing

Chem 115 POGIL Worksheet - Week 1 Units, Measurement Uncertainty, and Significant Figures

Lecture 19: Introduction to Kinetics First a CH 302 Kinetics Study Guide (Memorize these first three pages, they are all the background you need)

LECTURE 1 THE CONTENTS OF THIS LECTURE ARE AS FOLLOWS:

Basic Probability Reference Sheet

Uncertainty and Graphical Analysis

Go back to the main index page

2. As we shall see, we choose to write in terms of σ x because ( X ) 2 = σ 2 x.

Introduction to Physics Physics 114 Eyres

Natural Language Processing Prof. Pawan Goyal Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur

Statistics: Error (Chpt. 5)

Section 9 2B:!! Using Confidence Intervals to Estimate the Difference ( µ 1 µ 2 ) in Two Population Means using Two Independent Samples.

Ch. 1, Physics & Measurement

Moment of Inertia: Rotational Energy

The following are generally referred to as the laws or rules of exponents. x a x b = x a+b (5.1) 1 x b a (5.2) (x a ) b = x ab (5.

Statistics. Lent Term 2015 Prof. Mark Thomson. 2: The Gaussian Limit

Network Simulation Chapter 6: Output Data Analysis

Lab #12: Exam 3 Review Key

Final Exam - Solutions

HOLLOMAN S AP STATISTICS BVD CHAPTER 08, PAGE 1 OF 11. Figure 1 - Variation in the Response Variable

Civil Engineering Computation. Initial Meeting and Class Setup EXCEL Review Problems

Error Analysis. To become familiar with some principles of error analysis for use in later laboratories.

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Mathematics-I Prof. S.K. Ray Department of Mathematics and Statistics Indian Institute of Technology, Kanpur. Lecture 1 Real Numbers

AP Physics 1 Summer Assignment Packet 2

Gaussian Quiz. Preamble to The Humble Gaussian Distribution. David MacKay 1

Design and Optimization of Energy Systems Prof. C. Balaji Department of Mechanical Engineering Indian Institute of Technology, Madras

161 Sp18 T1 grades (out of 40, max 100)

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning

Statistical Intervals (One sample) (Chs )

I used college textbooks because they were the only resource available to evaluate measurement uncertainty calculations.

Review of Functions. Functions. Philippe B. Laval. Current Semester KSU. Philippe B. Laval (KSU) Functions Current Semester 1 / 12

Line Broadening. φ(ν) = Γ/4π 2 (ν ν 0 ) 2 + (Γ/4π) 2, (3) where now Γ = γ +2ν col includes contributions from both natural broadening and collisions.

Two-sample inference: Continuous data

Uncertainty in Measurement

26, 24, 26, 28, 23, 23, 25, 24, 26, 25

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

(Refer Slide Time: )

Take the measurement of a person's height as an example. Assuming that her height has been determined to be 5' 8", how accurate is our result?

Convergence of Sequences

Statistical Methods in Particle Physics

Data Analysis and Statistical Methods Statistics 651

Finite Mathematics : A Business Approach

Lecturer three. General Concept in Q. M. Imagine a room containing 14 people, whose ages are as follows:

ik () uk () Today s menu Last lecture Some definitions Repeatability of sensing elements

the pitot static measurement equal to a constant C which is to take into account the effect of viscosity and so on.

Lecture 3: Statistical sampling uncertainty

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population

Derivatives: Gradient of a line, gradient of a curve

Modern Algebra Prof. Manindra Agrawal Department of Computer Science and Engineering Indian Institute of Technology, Kanpur

Lecture 18: Simple Linear Regression

Uncertainty, Error, and Precision in Quantitative Measurements an Introduction 4.4 cm Experimental error

Treatment of Error in Experimental Measurements

211 Real Analysis. f (x) = x2 1. x 1. x 2 1

AN INTRODUCTION TO TOPIC MODELS

Physics 1140 Fall 2013 Introduction to Experimental Physics

Business Statistics. Lecture 5: Confidence Intervals

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

Transcription:

Basic Statistical Analysis! Prof. Dan Burkey! CHEG 4137W!! UConn CMBE!! Fall 011! 1

Why do we care? We can only make a finite number of measurements of a system Statistical analysis tells us something about the quality of the data we obtain. Error analysis and propagation of error let us talk about confidence in our numbers, both measured and calculated.

Terminology Statistical analysis has its own language A distinct branch of mathematics There are entire, extremely detailed courses in probability and statistics. This lecture is designed to be an introduction to statistical analysis. Think of it as the wading pool - get your feet wet! 3

The what now? With the who? Accuracy vs. Precision Accuracy: How close a series of measured values are to the actual or true value. Precision: (aka reproducibility or repeatability) The degree to which repeated measurements made under identical or unchanged conditions yield the same result. Measurements you make can be accurate but imprecise, precise but inaccurate, neither, or both 4

Accuracy and Precision Precise Not Accurate Value Accurate Not Precise Accurate and Precise True value Not Precise Not Accurate Number of observations 5

Let s Start Easy Average We all know this one! 1 N N " i=1 n i In Excel: =average(n 1,n,n 3, n N ) 6

Moving On Variance Sounds like what it is! " = 1 N N $ (x # µ) µ = 1 i N i=1 N " x i i=1 The more scatter, or variability, in your data, the greater the variance will be In Excel: varp(n 1,n,n 3, n N )! 7

Population vs. Sample Population: All possible measurements N in an experiment. Typically we can never sample the entire population - too time consuming! Sample: A subset, n, of the population that we use as representative of the entire population Sample size is important! If n is too small, it may not accurately represent the population. 8

Sample Variance Sample Variance Accounts for using a subset of the population n s = 1 #(x " x ) x = 1 n "1 i n i=1 n " x i i=1 (n-1) vs. N in the denominator - i.e. there is more variance in the sample than the entire population Why? In Excel: var(n 1,n,n 3, n N )! 9

Standard Deviation Standard Deviation May be familiar with this one as well s = s = n 1 #(x " x ) x = 1 n "1 i n i=1 n " x i i=1 Standard deviation is useful for predicting the confidence interval In Excel: stdev(n 1,n,n 3, n N )! 10

Standard Deviation Standard Deviation gives an idea of the probability that a measured value will be a certain distance from the mean or expected value - for normally distributed data! Number of Measurements The more measurements taken, the closer this histogram approaches the a true normal distribution. Measured Value 11

Standard Deviation The normal distribution is often used to describe, at least approximately, any variable that tends to cluster around a mean.! PDF = 1 % (x $ x ( ) exp ' $ "# & # * ) 1

Confidence Intervals A confidence interval is simply the probability that a measured value falls within a certain range of the mean or average value.! For normally distributed data, we expect ~95% of the measured values to be within standard deviations of the mean. For normally distributed data, we expect ~68% of the measured values to be within 1 standard deviation of the mean. 3σ = 99.6%" 4σ = 99.8% 13

Confidence Intervals Calculating a Confidence Interval: In Excel: confidence(alpha, stdev, n)" Alpha = confidence level. i.e. 0.05 = 95%" stdev = standard deviation for the data" n = sample size" Example: Everyone take out your wallet and driver s license." Tell me your height in inches.! 14

Confidence Intervals Student s T-test What happens when your sample size is small compared to the population?" The standard deviation as calculated from 10 data points will be different than that calculated from, say 1000 data points." TheT-test can be thought of as a correction which accounts for the uncertainty that arises from only sampling a small subset of the population! 15

Student T-test In Excel: =tinv(probability,dof)! Probability is the interval of interest:" i.e. 95% = 0.05" DOF = Degrees of Freedom" n - 1; That is, one less than the number of measurements." The number returned is the modifier for confidence interval." e.g. 95% with a large number of samples: ~σ" e.g. 95% with 10 samples:.6σ e.g. 95% with 3 samples: 4.3σ 16

Propagation of Error If we make a number of observations or measurements, and then perform a series of calculations based on those measurements, how do the individual errors/variations/uncertainties in my measured quantities effect (propagate through to) the calculated quantity?" Need a way to mathematically describe this phenomenon " 17

Example: Reynolds Number Say you want to calculate the Reynolds number for flow through a pipe: Re = "VD µ Note: You didn t measure V, you measured a mass of fluid collected over time. 18

Example: Reynolds Number V = 4m "t#d Re = 4m "tdµ Sources of uncertainty or error:" Mass: Say you collected 5 kg. The scale is accurate to 0.5 kg. Did you get exactly 5 kg? Was it more or less?" Time: Collection took 90 seconds. Your stopwatch is accurate to 1 second. Did it take exactly 90 s?" Diameter: The pipe was manufactured to a specified tolerance: 1/ Schedule 80: ID is 0.546 ± 0.00 " Viscosity: What is this a function of? π and 4 can be considered exact. 19

Propagation of Error In general, for a function Z = F(A,B):" "Z = $ #F ' & ) %#A( ("A) $ + & #F %#B ' ) ( ("B)... Can be extended to arbitrary number of variables." 0

Propagation of Error Example Z = F(A,B) = A + B" Thus:" "F "A =1 "F "B =1 "Z = ("A) + ("B) 1

Propagation of Error Some Common Forms:" Z = A + B : "Z = "A ( ) + ("B) Z = A # B : "Z = ("A) + ("B) Z = AB : "Z Z = $ "A' & ) % A ( $ + & "B % B ' ) ( Z = A B : "Z Z = $ "A' & ) % A ( Z = A n : "Z Z = n "A A $ + "B ' & ) % B ( Z = ln(a) : "Z = "A A Z = exp(a) : "Z Z = "A

Example: Reynolds Number Thus: Re = 4m "tdµ " Re Re = # "m& % ( $ m ' # + "t & % ( $ t ' # + % "D $ D & ( ' # + "µ & % ( $ µ ' Exercise: Prove this is the right form from the derivatives 3

Example: Reynolds Number " Re Re = # "m& % ( $ m ' # + "t & % ( $ t ' # + % "D $ D & ( ' # + "µ & % ( $ µ ' Sources of uncertainty or error:" Mass: Δm = 0.5 kg; m = 5 kg." Time: Δt = 1 sec; t = 90 sec." Diameter: ΔD = 0.00 in.; D = 0.546 in." Viscosity: F(T,P) = maybe assume constant, if conditions unchanged over the course of the measurements? " 8.68E-4 kg m -1 s -1 at 5 C" 4

Example: Reynolds Number " Re Re = # 0.5& % ( $ 5 ' # + 1 & % ( $ 90' # +.00 & % ( $ 0.546' # 0 + % $ 8.63x10-4 & ( ' And:" Re = 4m "tdµ = 4(5kg) (0.054 m) "(90s)(0.546in) 1in (8.68x10-4 kgm #1 s #1 ) Re = 4m "tdµ = 9380 = 9000 (sig figs) 5

Example: Reynolds Number " Re = Re # % $ 0.5 5 & ( ' # + 1 & % ( $ 90' # +.00 & % ( $ 0.546' # 0 + % $ 8.63x10-4 & ( ' " Re = 67 =1000 (sig figs) Thus, we would report Re = 9,000 ± 1000 6