Statistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Testing. Confidence intervals on mean. CL = x ± t * CL1- = exp

Similar documents
Chapter 7 Comparison of two independent samples

ECO220Y Hypothesis Testing: Type I and Type II Errors and Power Readings: Chapter 12,

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Two-Sample Inferential Statistics

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Ch 2: Simple Linear Regression

Outline. PubH 5450 Biostatistics I Prof. Carlin. Confidence Interval for the Mean. Part I. Reviews

CH.9 Tests of Hypotheses for a Single Sample

LECTURE 5. Introduction to Econometrics. Hypothesis testing

SMAM 314 Exam 3d Name

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Table 1: Fish Biomass data set on 26 streams

Psychology 282 Lecture #4 Outline Inferences in SLR

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

Mathematical statistics

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

Tests about a population mean

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment

Chapter 7: Hypothesis Testing

Performance Evaluation and Comparison

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes

Single Sample Means. SOCY601 Alan Neustadtl

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

HYPOTHESIS TESTING. Hypothesis Testing

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

Visual interpretation with normal approximation

Ch. 7. One sample hypothesis tests for µ and σ

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Control of Manufacturing Processes

Correlation and Regression (Excel 2007)

LECTURE 5 HYPOTHESIS TESTING

Applied Statistics for the Behavioral Sciences

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Quantitative Analysis and Empirical Methods

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Inference for Regression

Topic 19 Extensions on the Likelihood Ratio

df=degrees of freedom = n - 1

Data Analysis and Statistical Methods Statistics 651

Business Statistics. Lecture 10: Course Review

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments

AMS 7 Correlation and Regression Lecture 8

Sociology 6Z03 Review II

Methods for Identifying Out-of-Trend Data in Analysis of Stability Measurements Part II: By-Time-Point and Multivariate Control Chart

Lectures 5 & 6: Hypothesis Testing

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Simple and Multiple Linear Regression

MAT 2377C FINAL EXAM PRACTICE

Lecture 3: Inference in SLR

SMAM 314 Exam 3 Name. F A. A null hypothesis that is rejected at α =.05 will always be rejected at α =.01.

Two Sample Hypothesis Tests

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 30. DATA 8 Summer Regression Inference

Mathematical Notation Math Introduction to Applied Statistics

Density Temp vs Ratio. temp

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

Econometrics. 4) Statistical inference

Two-sample inference: Continuous data

OHSU OGI Class ECE-580-DOE :Statistical Process Control and Design of Experiments Steve Brainerd Basic Statistics Sample size?

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

Mathematical statistics

Lab #11. Variable B. Variable A Y a b a+b N c d c+d a+c b+d N = a+b+c+d

Ch 3: Multiple Linear Regression

Chapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES. Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

STA 6167 Exam 1 Spring 2016 PRINT Name

Chemometrics. Matti Hotokka Physical chemistry Åbo Akademi University

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

Medical statistics part I, autumn 2010: One sample test of hypothesis

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

1 Hypothesis testing for a single mean

Data Analysis and Statistical Methods Statistics 651

The t-statistic. Student s t Test

Introduction to Statistical Data Analysis III

Note on Bivariate Regression: Connecting Practice and Theory. Konstantin Kashin

Lecture 5: ANOVA and Correlation

Lecture 15: Inference Based on Two Samples

Chapter 8 - Statistical intervals for a single sample

CBA4 is live in practice mode this week exam mode from Saturday!

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

Hypothesis testing (cont d)

Introduction to Business Statistics QM 220 Chapter 12

Introduction to Statistics

One sample problem. sample mean: ȳ = . sample variance: s 2 = sample standard deviation: s = s 2. y i n. i=1. i=1 (y i ȳ) 2 n 1

Transcription:

Statistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Lecture Notes 1 Confidence intervals on mean Normal Distribution CL = x ± t * 1-α 1- α,n-1 s n Log-Normal Distribution CL = exp 1-α CL1- = exp α sln x x ln x ± t 1- α, n-1 * n ln(gsd ln(gm) ± t1- α, n-1 * n for 2-sided 95% confidence interval t 1-α/2,n-1 = t 0.975,n-1 2 for 1-sided 95% confidence interval t 1-α,n-1 = t 0.95,n-1 Warren Myers 2001-2005, Revised by Guffey, 2006 1

Example #1 confidence interval Given a GM of 24 a GSD of 2.7 and a sample size of 17, determine a 2-side, 95% confidence interval on the GM. 3 Example #1 solution CL s ln(gsd * ln(gm) * n ln x xln x ± t1- α, n-1 ± t1- α, n-1 1- = exp n = exp α ln( 2.7 ln(24) ± * 3.178 ± 2.12 * 0.241 t.95, 16 = exp 17 = exp ( ( )) ( 3.178 2.12 *(.241)) LCL95% = exp = 14.4 ( 3.178 + 2.12 * ( 0.241) ) UCL95% = exp = 40 4 We are 95% confident the true population mean is between 14.4 and 40. Warren Myers 2001-2005, Revised by Guffey, 2006 2

95%, 95% Tolerance Limits on Sample Distribution Normal Distribution TL = x ± s 0.95,0.95 Kγ,P,n Log-Normal Distribution TL0.95,0.95 = exp = exp ( x ± Kγ, P, n * sln x) ln x ( ln( GM ) ± Kγ, P, n * ln( GSD) ) 5 Where: γ is the probability that at least a proportion P or the population occurs Example #2 tolerance limits Determine an upper 95%, 95% tolerance limit on a distribution of exposures defined with a GM of 24 a GSD of 2.7 and a sample size of 17. 6 Warren Myers 2001-2005, Revised by Guffey, 2006 3

Example #2 solution UTL0.95,0.95 = exp = exp ( ln( GM ) + K0.95,0. 95, 17 * ln( GSD) ) ( ln(24) + 2.486 * ln(2.7) ) ( 5.647) = exp = 284 7 We are 95% confident that 95% of the exposures in the distribution will be less than 284. Exceedance Fraction For normally or lognormally distributed data, an estimate of the fraction (F) of future exposures (sometimes called the exceedence fraction) that exceed a particular level can be calculated as: normal lognorm OEL x) Exc. Fract.= P( c > OEL) = P Z > s ln( OEL) ln( GM ) Exc. Fract.= P( c > OEL) = P Z > ln( GSD) 8 Probability that a future exposure measurement made from the same sample group will exceed your chosen exposure limit is percent. Warren Myers 2001-2005, Revised by Guffey, 2006 4

Example #3 exceedance Given an OEL of 35, a GM =24, GSD = 2.7, what is the probability that a future exposure measurement made from the same sample group will exceed the exposure limit. 9 Example #3 solution F = P(c > OEL) = P z > ln(oel)-ln(gm) ln(gsd) ( ) ( ) ln( 2.7 ) ln 35 - ln 24 = P z > = PZ ( > 0.38) = 1- P(z < 0.38) = 1-0.648 = 35.2% 10 The probability that a future exposure measurement made from this same sample group will exceed 40 is 30.5% Warren Myers 2001-2005, Revised by Guffey, 2006 5

Example #4 exceedance Given: OEL of 35 GM =24 GSD = 2.7 What would you need to have your GM value to be if you wanted to be 80% sure that a future exposure measurement made from the same sample group will not exceed the exposure limit. (or have an exceedance fraction of than 20%) 11 Solution #4 Z p = ln(oel) ln( GM ) ln(gsd) ln( GM ) = ln(oel) Z ln(gsd) = ln(35) 0.842 ln(2.7) = 3.555 0.8363 = 2.7187 p 12 2.7187 GM = e = 15 Warren Myers 2001-2005, Revised by Guffey, 2006 6

Hypothesis Used to choose between two alternatives (in regression we test all the time whether the slope of the regression line is equal to 0 or not) We begin by stating the problem and then stating the null hypothesis and the alternative hypothesis Null hypothesis states that the results we observe are due to chance Null hypothesis always that variable is actually constant Alternative hypothesis states is unlikely to be true unless the null hypothesis is likely to be false. 13 Hypothesis testing Calculate a test statistic (choice of statistic depends on the sample size and on the distribution of the statistic, we saw that sample averages are distributed normal) Find the P-value associated with the test statistics 14 Warren Myers 2001-2005, Revised by Guffey, 2006 7

Hypothesis testing - Significance Level Significance level is the threshold at which we are confident to reject Ho or we fail to reject the Ho. Choice of the significance level depends on the problem. Typically: 5% is called statistically significant, 1% - highly significant 15 Type I versus Type II error: Statistical Decision True state of null Reject null Do not reject null True Type I error Correct Not true Correct Type II error 16 Warren Myers 2001-2005, Revised by Guffey, 2006 8

t-tests 17 For testing hypothesis involving two means (>, < or different from a given pop mean) Use T-test when: significance of the mean(s) the number of subjects in the sample is less than 30 continuous data true standard deviation of the population is unknown Hypothesis construction for Ho: x = u Ha: x > u Ha: x < u Ha: x <> u Determine level of significance α = 0.05 α = 0.01 Difference in means; assumed to be equal variance t = S p X a X b 1 1 + n n a b S p = ( 1) + ( 1) s n s n 2 2 a a b b n + n 2 a b 18 df = n a + n b 2 Where: df = degrees of freedom X = observed mean of each group sample s 2 = observed variance of each sample S p = pooled estimate of the standard deviation n = number of samples Warren Myers 2001-2005, Revised by Guffey, 2006 9

Decision rule If T > t α, n-1 null hypothesis is rejected alternative is accepted. P(error if reject) = α 19 t-test example 20 Warren Myers 2001-2005, Revised by Guffey, 2006 10

z-tests 21 For testing hypothesis involving two means (>, <, <>) Test assumptions n > 25 normal distribution standard deviation of the pop is known Hypothesis construction for Ho: x = u Ha: x > u Ha: x < u Ha: x <> u Determine level of significance α = 0.05 α = 0.01 Test formula z = X a σ n b 2 2 a b a X σ + n b Where: X = observed mean of each group σ 2 = true variance of each group n = number of samples 22 Warren Myers 2001-2005, Revised by Guffey, 2006 11

Decision rule If Z > Z α null hypothesis is rejected alternative is accepted. The probability of incorrectly rejecting the null hypothesis (Type I error) because of the results is equal α 23 z-test example 24 Warren Myers 2001-2005, Revised by Guffey, 2006 12

25 Warren Myers 2001-2005, Revised by Guffey, 2006 13