Probability & Statistics: Infinite Statistics. Robert Leishman Mark Colton ME 363 Spring 2011

Similar documents
Probability & Statistics: Introduction. Robert Leishman Mark Colton ME 363 Spring 2011

Outline. Limits as x

CHAPTER 3 : QUADRARIC FUNCTIONS MODULE CONCEPT MAP Eercise 1 3. Recognizing the quadratic functions Graphs of quadratic functions 4 Eercis

2.0 Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table

Assessing and Presenting Experimental Data

Errors during the measurement process 3.1 Introduction

Signal Detection Basics - CFAR

MI-2 Prob. Set #4 DUE: Tuesday, Feb 15, 2011 Spring 11. 4) Write a matrix equation and give result:

Signals and Spectra - Review

Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table

Bayesian Technique for Reducing Uncertainty in Fatigue Failure Model

Continuous Random Variables

Chapter 2: Statistical Methods. 4. Total Measurement System and Errors. 2. Characterizing statistical distribution. 3. Interpretation of Results

JUST THE MATHS UNIT NUMBER 7.2. DETERMINANTS 2 (Consistency and third order determinants) A.J.Hobson

Lecture (20) DC Machine Examples Start of Synchronous Machines

Expectation Maximization Deconvolution Algorithm

Learning Targets: Standard Form: Quadratic Function. Parabola. Vertex Max/Min. x-coordinate of vertex Axis of symmetry. y-intercept.

Least Squares with Examples in Signal Processing 1. 2 Overdetermined equations. 1 Notation. The sum of squares of x is denoted by x 2 2, i.e.

Chapter 5 Torsion STRUCTURAL MECHANICS: CE203. Notes are based on Mechanics of Materials: by R. C. Hibbeler, 7th Edition, Pearson

Introduction to statistics

Probability Distribution

Estimation of Density Function Parameters with Censored Data from Product Life Tests

Central Limit Theorem and the Law of Large Numbers Class 6, Jeremy Orloff and Jonathan Bloom

MAT Mathematics in Today's World

CALCULUS APPLICATIONS OF DIFFERENTIATION LESSON PLAN. C3 Topic Overview

Section 5.4. Ken Ueda

Reporting Measurement and Uncertainty

MODULE 6 LECTURE NOTES 1 REVIEW OF PROBABILITY THEORY. Most water resources decision problems face the risk of uncertainty mainly because of the

EE16B - Spring 17 - Lecture 11B Notes 1

Scatterplots and Correlation

4/19/2009. Probability Distributions. Inference. Example 1. Example 2. Parameter versus statistic. Normal Probability Distribution N

Measures of Dispersion

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS

2. A Basic Statistical Toolbox

There is a unique function s(x) that has the required properties. It turns out to also satisfy

Chapter 4. Probability and Statistics. Probability and Statistics

Descriptive Univariate Statistics and Bivariate Correlation

The Gradient. Consider the topography of the Earth s surface.

Chapter 4. Probability-The Study of Randomness

1. Find the domain of the following functions. Write your answer using interval notation. (9 pts.)

Instance-level l recognition. Cordelia Schmid INRIA

Module 3. Quantization and Coding. Version 2, ECE IIT, Kharagpur

Introduction to Inequalities

Chapter 2. Random Variable. Define single random variables in terms of their PDF and CDF, and calculate moments such as the mean and variance.

Chapter 5. Understanding and Comparing. Distributions

Bayesian Updating with Continuous Priors Class 13, Jeremy Orloff and Jonathan Bloom

November 13, 2018 MAT186 Week 8 Justin Ko

Chapter 9 Regression. 9.1 Simple linear regression Linear models Least squares Predictions and residuals.

Machine Learning Lecture 3

Chapter 9. Non-Parametric Density Function Estimation

Reminders. Homework due tomorrow Quiz tomorrow

QUADRATIC GRAPHS ALGEBRA 2. Dr Adrian Jannetta MIMA CMath FRAS INU0114/514 (MATHS 1) Quadratic Graphs 1/ 16 Adrian Jannetta

Calculus with Algebra and Trigonometry II Lecture 21 Probability applications

Probability & Statistics

PDF Estimation via Characteristic Function and an Orthonormal Basis Set

MEASUREMENT OF STRESS PATTERN ON ROLL BEARING SUPPORTS BY THERMOELASTICITY

Vibrational Power Flow Considerations Arising From Multi-Dimensional Isolators. Abstract

Expression arrays, normalization, and error models

Introduction to Statistics and Error Analysis II

EMA 3702 Mechanics & Materials Science (Mechanics of Materials) Chapter 4 Pure Bending

Basic Statistical Tools

6.37 Determine the modulus of resilience for each of the following alloys:

UNIVERSITY OF BOLTON WESTERN INTERNATIONAL COLLEGE FZE BENG(HONS) MECHANICAL ENGINEERING SEMESTER ONE EXAMINATION 2016/2017 ENGINEERING PRINCIPLES 1

Section - 9 GRAPHS. (a) y f x (b) y f x. (c) y f x (d) y f x. (e) y f x (f) y f x k. (g) y f x k (h) y kf x. (i) y f kx. [a] y f x to y f x

are the objects described by a set of data. They may be people, animals or things.

Baruch College MTH 1030 Sample Final B Form 0809 PAGE 1

Introduction to machine learning and pattern recognition Lecture 2 Coryn Bailer-Jones

ME 201 Engineering Mechanics: Statics

Numerical Methods Lecture 7 - Statistics, Probability and Reliability

1 x 2 and 1 y 2. 0 otherwise. c) Estimate by eye the value of x for which F(x) = x + y 0 x 1 and 0 y 1. 0 otherwise

State Estimation Introduction 2.0 Exact Pseudo-measurments

X: The Hall Effect in Metals

State Space and Hidden Markov Models

Histograms, Central Tendency, and Variability

Pre-Algebra 8 Notes Exponents and Scientific Notation

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability

Stress Analysis Lecture 3 ME 276 Spring Dr./ Ahmed Mohamed Nagib Elmekawy

E(x i ) = µ i. 2 d. + sin 1 d θ 2. for d < θ 2 0 for d θ 2

Beyond Newton s method Thomas P. Minka

Chapter 27 Summary Inferences for Regression

MICROPIPETTE CALIBRATIONS

EXPERIMENT: REACTION TIME

Figure 4-1. Naval Combat Survivability Testbed. 4.1 CLASSIFICATION OF SINGLE PORT POWER CONVERTERS

Chapter 9. Non-Parametric Density Function Estimation

ME111 Instructor: Peter Pinsky Class #21 November 13, 2000

MECHANICS OF MATERIALS

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

Performance of fourth-grade students on an agility test

Lecture 17 - p-n Junction. October 11, Ideal p-n junction in equilibrium 2. Ideal p-n junction out of equilibrium

Statistical Modeling and Analysis of Scientific Inquiry: The Basics of Hypothesis Testing

Problem d d d B C E D. 0.8d. Additional lecturebook examples 29 ME 323

Chapter 6. Nonlinear Equations. 6.1 The Problem of Nonlinear Root-finding. 6.2 Rate of Convergence

Bell Quiz 2-3. Determine the end behavior of the graph using limit notation. Find a function with the given zeros , 2. 5 pts possible.

Design against fluctuating load

One Solution Two Solutions Three Solutions Four Solutions. Since both equations equal y we can set them equal Combine like terms Factor Solve for x

Dimensionality Reduction and Principle Components

Chapter 2: The Random Variable

EIE 240 Electrical and Electronic Measurements Class 2: January 16, 2015 Werapon Chiracharit. Measurement

18.01 Single Variable Calculus Fall 2006

Sean Carey Tafe No Lab Report: Hounsfield Tension Test

Transcription:

Probability & Statistics: Infinite Statistics Robert Leishman Mark Colton ME 363 Spring 0

Large Data Sets What happens to a histogram as N becomes large (N )? Number of bins becomes large (K ) Width of bins becomes small ( 0) Histogram becomes smoother and approaches a continuous function

Eample: ASTM-A4 Steel n (%) 0.5 0. 0.5 0. 0.05 0 450 460 470 480 490 500 50 S (Mpa) N = 00 K = S ave = 479.8 MPa n (%) 3.5.5.5 0.5 4 0-3 3 0 40 440 460 480 500 50 540 S (Mpa) N = 0,000,000 K =,80 S ave = 480.00 MPa

Probability Density Function For an infinite data set, the frequency distribution is smooth and continuous This function is called the probability density function (p.d.f.) The p.d.f. relates the value of a measured variable to its likelihood of occurring

Infinite vs. Finite Statistics In theory, we can have infinite data sets An infinite number of data points We can eamine the entire population, or all possible values In reality, data sets are finite A limited number of data points Based on a sample of the entire population We will use our limited sample sie to etract information about the entire population Eample: Etract information about ALL ASTM-A4 steel from 000 specimens

Standard p.d.f.s There are many shapes of p.d.f.s that can occur See Table 4. The shape of the p.d.f can be eperimentally determined by looking at the shape of the histogram, and finding which p.d.f best matches it

Standard p.d.f.s

Normal Distribution The normal or Gaussian distribution is one of the most common It can be used to represent variables that are random about a mean Electrical noise Variation in strength of steel Precision error Bell Curve

Normal Distribution The p.d.f. of the normal distribution is given by: p ( ) σ ( ) = σ e π is the value of a particular measurement is the true mean of the entire population Describes the tendency of the population Determines the center point of the normal curve Also called μ (when it is sampled, not true) σ is the standard deviation of the entire population Describes the population s spread Determines the width of the normal curve

Normal Distribution If we have the p.d.f. of some data, then we can make predictions about the probability P of future measurements Specifically, we can predict the probability of a data point falling within a certain interval This probability is given by the area under the p.d.f. over the appropriate interval + = + d p P ) ( ) ( + = + d e P σ π σ ) ( ) ( - +

Normal Distribution Make a change of variables: The integral then becomes Since the normal distribution is symmetric about : + = + d e P σ π σ ) ( ) ( σ σ = ) ( d e P π = 0 ) ( d e P π

Normal Distribution The bracketed epression is called the normal error function Solutions to this integral are tabulated in Table 4.3 (p. 8) The integral gives us a method for calculating the probability that lies between ± for a given distribution defined by and σ Best understood by doing some eamples σ σ = 0 ) ( d e P π

Eample What is the area under the normal distribution curve from = -.43 to =.43? What is the significance of this area?

Eample

From the table: Eample The area under the normal distribution curve from = 0 to =.43 is 0.436 This represents ½ of the integral between = -.43 to =.43 The total area is therefore (0.436) = 0.847 What does this mean? For data following a normal distribution, 84.7% of the population lies within the range -.43.43 But = + σ So this means that 84.7% of the population lies within ±.43 standard deviations of the mean This is true for any normally distributed data

Eample What range of a random variable will contain 90% of the population? Solution: Find such that 45% of the data lie between 0 and + and the other 45% lie between and 0 Use the table

Eample By interpolation, 0.45 =.645

Eample Again, = + σ So 90% of the population will fall within the range ( - 0.45 σ) < < ( + 0.45 σ) (.645σ) < < ( +.645σ) So 90% of the population will lie within ±.645 standard deviations of the mean

Comments The probability of a measurement occurring can be epressed in terms of the standard deviation of the population The probability of a measurement being within: ±σ of the mean is 68.7% ±σ of the mean is 95.45% ±3σ of the mean is 99.73% Data outside ±3σ are often considered outliers

Eample 3 You are assigned to measure the maimum noload speed of a new type of DC motor You apply a constant voltage to many of these motors and measure the maimum no-load speed for each motor You calculate that = 435.5 rpm and σ = 47.5 rpm Assuming that the variations in no-load speed are random (and normally distributed), what is the probability that a motor will have a no-load speed between 5000 and 500 rpm?

Eample 3 P(5000 500) = P(435.5 500) - P(435.5 5000) From upper limit (first term): = (500-435.5)/47.5 =.0696 P(.0696) = 0.4808 From lower limit (second term): = (5000-435.5)/47.5 =.608 P(.608) = 0.4474 P(5000 500) = 0.4808 0.4474 = 0.0334 = 3.34%