Day 8-3. Prakash Balachandran Department of Mathematics & Statistics Boston University. Friday, October 28, 2011

Similar documents
Common Large/Small Sample Tests 1/55

Chapter 13, Part A Analysis of Variance and Experimental Design

LESSON 20: HYPOTHESIS TESTING

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

Chapter 22: What is a Test of Significance?

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

1036: Probability & Statistics

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Math 140 Introductory Statistics

Topic 9: Sampling Distributions of Estimators

Sampling Distributions, Z-Tests, Power

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

University of California, Los Angeles Department of Statistics. Hypothesis testing

Last Lecture. Wald Test

Topic 9: Sampling Distributions of Estimators

Frequentist Inference

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Power and Type II Error

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Data Analysis and Statistical Methods Statistics 651

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions

Topic 9: Sampling Distributions of Estimators

Chapter 6 Sampling Distributions

1 Inferential Methods for Correlation and Regression Analysis

Topic 18: Composite Hypotheses

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Chapter two: Hypothesis testing

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Estimation of a population proportion March 23,

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

MA238 Assignment 4 Solutions (part a)

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Problem Set 4 Due Oct, 12

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

1 Constructing and Interpreting a Confidence Interval

A statistical method to determine sample size to estimate characteristic value of soil parameters

Final Examination Solutions 17/6/2010

Sample Size Determination (Two or More Samples)

1 Constructing and Interpreting a Confidence Interval

Chapter 1 (Definitions)

Module 1 Fundamentals in statistics

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

This is an introductory course in Analysis of Variance and Design of Experiments.

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

5. Likelihood Ratio Tests

A Confidence Interval for μ

Chapter 23: Inferences About Means

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

Successful HE applicants. Information sheet A Number of applicants. Gender Applicants Accepts Applicants Accepts. Age. Domicile

Lesson 2. Projects and Hand-ins. Hypothesis testing Chaptre 3. { } x=172.0 = 3.67

Parameter, Statistic and Random Samples

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Expectation and Variance of a random variable

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Statistical inference: example 1. Inferential Statistics

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

Chapter 5: Hypothesis testing

Properties and Hypothesis Testing

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Economics Spring 2015

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Comparing your lab results with the others by one-way ANOVA

Estimation for Complete Data

ORF 245 Fundamentals of Engineering Statistics. Midterm Exam 2

Announcements. Unit 5: Inference for Categorical Data Lecture 1: Inference for a single proportion

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

f(x)dx = 1 and f(x) 0 for all x.

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Section 11.8: Power Series

Mathematical Notation Math Introduction to Applied Statistics

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

Stat 319 Theory of Statistics (2) Exercises

z is the upper tail critical value from the normal distribution

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

A Statistical hypothesis is a conjecture about a population parameter. This conjecture may or may not be true. The null hypothesis, symbolized by H

STAC51: Categorical data Analysis

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Statistics 20: Final Exam Solutions Summer Session 2007

Estimating the Population Mean - when a sample average is calculated we can create an interval centered on this average

The Hong Kong University of Science & Technology ISOM551 Introductory Statistics for Business Assignment 3 Suggested Solution

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Machine Learning Brett Bernstein

Transcription:

Day 8-3 Prakash Balachadra Departmet of Mathematics & Statistics Bosto Uiversity Friday, October 8, 011 Sectio 5.: Hypothesis Tests forµ Aoucemets: Tutorig: Math office i 111 Cummigto 1st floor, Rich Hall. Away week of November 13. Xiyi will fill i. Cheat sheet - 1 pg, sigle sided. Office hours ext Friday moved to ext Wedesday so you guys ca ask me questios before the exam. 3.: Usigp-values (σ kow) 1. Assume that, (a) The sample is obtaied usig simple radom samplig, (b) The sample has o outliers ad the populatio from which the sample is draw is ormally distributed or that 30.. Step 1: Determie if the test is a two, left, or right-tailed test. 3. Select a level of sigificaceαbased o the seriousess of makig a Type I error, ad compute if left or right-tailed orp ] Z > zα if two-tailed. 4. Compute 5. If, PZ > z α ] = α, Z N(0,1) z 0 = X µ 0 σ/ which is the umber of stadard errors the sample mea X is from the ull hypothesis mea µ 0. (a) (two-tailed)pz > z 0 ] < α reject the ull hypothesis. (b) (left-tailed)pz < z 0 ] < α reject the ull hypothesis. 1

(c) (right-tailed)pz > z 0 ] < α reject the ull hypothesis. Thesep-values quatify the probability that a sample will result i a sample mea such as the oe obtaied if the ull hypothesis is true. Thus, we reject the ull hypothesis if thep-value is too small (so that it s very ulikely that we got the sample mea we did assumig the ull hypothesis is true) ad so yields the exact sigificace of the data. (σ ukow) 1. Assume that, (a) The sample is obtaied usig simple radom samplig, (b) The sample has o outliers ad the populatio from which the sample is draw is ormally distributed or 30.. Step 1: Determie if the test is a two, left, or right-tailed test. 3. Select a level of sigificaceαbased o the seriousess of makig a Type I error, ad compute if left or right-tailed orp ] Z > zα if two-tailed. 4. Compute 5. If, Examples: PZ > t α ] = α, Z tdist( 1) t 0 = X µ 0 s/ is the umber of sample stadard errors away X is from µ 0. (a) (two-tailed)pz > t 0 ] < α reject the ull hypothesis. (b) (left-tailed)pz < t 0 ] < α reject the ull hypothesis. (c) (right-tailed)pz > t 0 ] < α reject the ull hypothesis. A egieer wats to measure the bias i a ph meter. She uses the meter to measure the ph i 14 atural substaces ad obtais the followig data, 7.01, 7.04, 6.97, 7.00, 6.99, 6.97, 7.04, 7.04, 7.01, 7.00, 6.99, 7.04, 7.07, 6.97. Is there sufficiet evidece to support the claim that the ph meter is ot correctly calibrated at α = 0.05? Solutio: The data s free of outliers ad approximately ormal, so we re good. This is a two-tailed test. µ 0 = 7.0, X = 7.01, s = 0.03, = 14. t 0 = X µ 0 s/ = 7.01 7.0 0.03/ 14 = 1.1693. So, PZ > t 0 ] = PZ > t 0 ] + PZ < t 0 ] = PZ > t 0 ] = PZ > 1.1693] = 0.1316 = 0.633.

0.8 0.75 0.7 0.65 0.6 0.55 6.97 6.98 6.99 7 7.01 7.0 7.03 7.04 7.05 7.06 7.07 7.07 7.06 7.05 7.04 7.03 7.0 7.01 7 6.99 6.98 6.97 1 3

Sice 0.633 > 0.05, we caot reject the ull hypothesis, so that with a 5% level of sigificace, the ph meters are calibrated correctly. I this example, we kow the exact level of sigificace i the data, amely 35%. This is the smallest level of sigificace we eed i order to still reject H 0. 3.3: Aalysis ad Coclusios of the Hypothesis Tests (Power) Whe performig hypothesis testig, we cotrolαby prescribig it outright, where recall that, α = PType I Error] = PReject H 0 H 0 True]. β = PType II Error] = PAccept H 0 H 1 True]. β o the other had is difficult to cotrol sice it depeds o several factors. Oe of these parameters isαitself, β = PH 0 True H 1 True] = PH 1 True H 0 True] PH 0 True] PH 1 True] = (1 α) PH 0 True] PH 1 True] Thus, icreasigαdecreases β, so that oe must weight the choice of a smallerβ, which is desirable, with a larger level of sigificace,α, which is udesirable. The power of a test is defied to be the ability of the test to detect or reject a false ull hypothesis. Quatitatively, it s defied by, Power = 1 β = PReject H 0 H 0 False] = PReject H 0 H 1 True]. As power icreases, β decreases, resultig i a better test (lower type II error). Power is a complicated fuctio of three variables: 1. =Sample Size,. α=level of Sigificace=PType I Error], 3. ES=The Effect Size=The stadardized differece i meas specified uder H 0 ad H 1 defied by, ES = µ 0 µ 1. σ As such, we ll focus oly o the case where σ is kow. Let s ivestigate the effect of each of these parameters o power. 4

Figure 1: X uderh0 with mea µ 0 = 100. Figure : X uderh0 with mea µ 0 = 100 ad H 1 withµ 1 = 110. 5

Figure 3: X uderh0 ad H 1 withα = 0.05. 6

Figure 4: Chagigαto0.1 fixigµ 1 = 110. Figure 5: Chagig µ 1 to 10 with fixig α = 0.05. All pictures from D Agostio et al. 7

The last effect is chagig the sample size, but we ve doe this before: a larger sample size esures a more powerful test. Let s compute the power= 1 β where, for ow, we re cosiderig a right-tailed test. Recall that z α is defied by, PZ > z α ] = α where Z = X 0 µ 0 σ/. Thus, ] X0 µ 0 P σ/ > z α = α P X 0 > σ ] z α +µ 0 = α ad so x α = σ z α +µ 0 i the above picture. Now, X 1 = σ Z +µ 1 so that, 1 β = P X1 > x α ] = P σ Z +µ 1 > σ z α +µ 0 ] Sice this is a right-tailed test,µ 1 > µ 0 so that 1 β = P Z > z α µ ] 1 µ 0 1 β = P Z > z α µ ] 1 µ 0 We ca repeat similar argumets for the left-tailed ad two-tailed tests, obtaiig the followig formulae for power: 1. For a two-sided test:. For a oe-sided test (left or right) Examples: Power = P Z > zα µ ] 0 µ 1 Power = P Z > z α µ ] 0 µ 1 8

(# 9) We wish to test the hypothesis that the mea weight for females who are 5 8 is 140 pouds. Assumig σ = 15 usig a 5% level of sigificace with = 36, fid the power of the test ifµ = 150 usig a two-sided test. Solutio: Sice this is two-tailed test, Power = P Z > zα µ ] 0 µ 1 µ 0 = 140,µ 1 = 150, = 36, σ = 15. zα = z 0.05 = 1.96, so that Power = P Z > 1.96 140 150 ] 15/ = PZ >.04] = 1 PZ <.04] = 1 0.007 = 0.9793. 36 (Sample Size Determiatio) Take agai the right-tailed test. We have from the computatio of power, This defies, Sicez β = z 1 β, so that, 1 β = P Z > z α µ ] 0 µ 1 z 1 β = z α µ 0 µ 1 σ/. z β = z α µ 0 µ 1 σ/ = z α ES = z α +z β ES = ( ) zα +z β. ES The same ca be doe for the left-tailed ad two-tailed tests, so that we have the followig formulae for sample-size determiatio, 1. For a two-tailed test,. For a oe-sided test, it s give by, ( z α = +z ) β. ES Example: = ( ) zα +z β. ES 9

We wish to ru the followig test:, H 0 : µ = 100 H 1 : µ 100 at α = 0.1. Ifσ = 0, how large of a sample would be required so that β = 0.04 ifµ = 10? Solutio: This is a two-tailed test. ES = µ 0 µ 1 σ = 100 10 0 = 0.1. Sice α = 0.1 ad β = 0.04, ( z α = +z ) ( ) ( ) β z0.05 +z 0.04 1.645+1.755 = = = 34. ES 0.1 0.1 Note: I looked upz 0.05 ad z 0.04 i the z-table. 10