Solution: First note that the power function of the test is given as follows,

Similar documents
MATH 728 Homework 3. Oleksandr Pavlenko

i=1 X i/n i=1 (X i X) 2 /(n 1). Find the constant c so that the statistic c(x X n+1 )/S has a t-distribution. If n = 8, determine k such that

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Comparing two independent samples

Summary of Chapters 7-9

Topic 15: Simple Hypotheses

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

1 Statistical inference for a population mean

The Components of a Statistical Hypothesis Testing Problem

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Summary: the confidence interval for the mean (σ 2 known) with gaussian assumption

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes

EXAM 3 Math 1342 Elementary Statistics 6-7

Review. December 4 th, Review

Tests about a population mean

TUTORIAL 8 SOLUTIONS #

Introduction to Statistics

Statistical Inference

Section 6.2 Hypothesis Testing

Tables Table A Table B Table C Table D Table E 675

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

1 Hypothesis testing for a single mean

Chapters 10. Hypothesis Testing

Visual interpretation with normal approximation

Institute of Actuaries of India

This paper is not to be removed from the Examination Halls

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

Lecture 10: Comparing two populations: proportions

Topic 19 Extensions on the Likelihood Ratio

Performance Evaluation and Comparison

Chapter 8 of Devore , H 1 :

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Chapter 9: Hypothesis Testing Sections

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Introductory Econometrics

Statistical Hypothesis Testing

Lecture 13: p-values and union intersection tests

40.530: Statistics. Professor Chen Zehua. Singapore University of Design and Technology

CH.9 Tests of Hypotheses for a Single Sample

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Tests for Population Proportion(s)

Rules for Means and Variances; Prediction

8.1-4 Test of Hypotheses Based on a Single Sample

Math 562 Homework 1 August 29, 2006 Dr. Ron Sahoo

Hypothesis Testing. ECE 3530 Spring Antonio Paiva

F79SM STATISTICAL METHODS

Chapters 10. Hypothesis Testing

, 0 x < 2. a. Find the probability that the text is checked out for more than half an hour but less than an hour. = (1/2)2

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

STAT100 Elementary Statistics and Probability

Chapter 7 Comparison of two independent samples

Topic 10: Hypothesis Testing

Statistical Inference

Charles Geyer University of Minnesota. joint work with. Glen Meeden University of Minnesota.

Chapter 10: Inferences based on two samples

Qualifying Exam in Probability and Statistics.

Chapter 7. Hypothesis Testing

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

Topic 10: Hypothesis Testing

Paired comparisons. We assume that

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Math 152. Rumbos Fall Solutions to Assignment #12

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

Two Sample Hypothesis Tests

Mathematical statistics

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.

Recall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n

MTMS Mathematical Statistics

Smoking Habits. Moderate Smokers Heavy Smokers Total. Hypertension No Hypertension Total

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

Math 494: Mathematical Statistics

Statistics. Statistics

Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS

Estimating the accuracy of a hypothesis Setting. Assume a binary classification setting

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

An interval estimator of a parameter θ is of the form θl < θ < θu at a

STAT 4385 Topic 01: Introduction & Review

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings

Chapter 9. Hypothesis testing. 9.1 Introduction

Chapter 3. Comparing two populations

Math Review Sheet, Fall 2008

Comparison of Two Samples

Sociology 6Z03 Review II

Hypothesis Testing Chap 10p460

Loglikelihood and Confidence Intervals

Chapter 14 Simple Linear Regression (A)

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59)

Ling 289 Contingency Table Statistics

Basic Concepts of Inference

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

6.4 Type I and Type II Errors

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Central Limit Theorem ( 5.3)

Probability and Statistics Notes

Transcription:

Problem 4.5.8: Assume the life of a tire given by X is distributed N(θ, 5000 ) Past experience indicates that θ = 30000. The manufacturere claims the tires made by a new process have mean θ > 30000. Is it possible that θ = 35000? Tets this claim by testing H 0 : θ = 30000 against H : θ > 30000. We observe n independent values denoted x, x,..., x n and we reject H 0 iff x c. Determine n and c such that the power function of the test, γ(θ) has the values γ(30000) = 0.0 and γ(35000) = 0.98 Solution: First note that the power function of the test is given as follows, Now because x θ 5000/n γ(θ) = P ( x c µ = θ, σ = 5000) ( x θ = P 5000/ n c θ ) 5000/ n N(0, ), it follows that in order to satisfy the given conditions, c 30000 5000/ n = z 0.0 and c 35000 5000/ n = z 0.98 All that remains is to substitute in the appropriate z values and solve the system of equations as follows, c 30000 5000/ n =.36 c 30000 = 5000(.36) n c = 630 n + 30000 () c 35000 5000/ n =.054 c 35000 = 5000(.054) n c = 070 n + 35000 () Setting () equal to () yeilds, 070 n + 35000 = 630 n + 30000 5000 = 900 n n = 900 5000 n = (4.38) = 9.844 (3) In order to be conservative rounding (3) to 0 and substituting into () yields, c = 630 0 + 30000 = 3600.547 So as a conservative decision, n = 0 and c = 3600.547

Problem 4.5.9: let X have a Poisson distribution with mean θ. Consider the simple hypothesis H 0 : θ = / and the alternative composite hypothesis H : θ < / Thus, Ω = {θ : 0 < θ /}. Let X, X,..., X be a random sample from this distribution. We reject H 0 iff the observed value of Y = X +X + +X. If γ(θ) is the power function of the test, find the powers γ(/), γ(/3), γ(/4), γ(/6), and γ(/). Sketch the graph of γ(θ). What is the significance level of the test? Solution: First note that because Y is a sum of i.i.d poisson random variables Y Poisson(θ) and γ(θ) is given as follows, γ(θ) = P (Y µ = θ) = e θ (θ) i i=0 i! = e θ + e θ (θ) + e θ (θ) The computed values of γ(θ) for desired values of θ are summarized below: γ(θ) The graph with these values shown is below: θ / /3 /4 /6 / γ(θ) 5 3 7 5 5 e 6 e 4 e 3 e e 0.06 = α θ Finally the significance of the test is given by the probability of rejecting H 0 when it is true as expressed by, P (Y θ = /) = 0.06

Problem 4.5.0: Let Y have a binomial distribution with parameters n and p. We reject H 0 : p = / and accept H : p > / if Y c. Find n and c to give a power function γ(p) such that γ(/) = 0.0 and γ(/3) =.95, approximately. Solution: First note the power function is given by, γ(p) = P (Y c p = ˆp) = P (Y < c p = ˆp) = c k=0 ( ) n p k ( p) n k k However, because this is computationally difficult to solve, we instead use the normal approximation to the binomial as follows, γ(p) = P (Y c p = ˆp) P (X c) where X N(np, np( p)) ( ) X np = P c np np( p) np( p) Because X np N(0, ) it follows that in order to satisfy the conditions, np( p) c n(0.5) n(0.5) = z 0.0 and c n(/3) n(/9) = z 0.95 All that remains is to substitute in the appropriate z values and solve the system of equations as follows, c n(0.5) n(0.5) = z 0.0 c n(0.5) n(0.5) =.8 c n(/3) n(/9) = z 0.95 c n(/3) n(/9) =.645 c 0.5n =.8 n(0.5) c =.8 n(0.5) + 0.5n (4) c n(/3) =.645 n(/9) c =.645 n(/9) + n(/3) (5) 3

Setting (4) equal to (5) yields,.645 n(/9) + n(/3) =.8 n(0.5) + 0.5n n [(/3) (/)] = n[.8 0.5 +.645 /9] n =.8 0.5 +.645 /9 (/3) (/) n = (8.50) = 7.5 (6) In order to be conservative rounding (6) to 73 and substituting into (4) yields, c =.8 73(0.5) + 0.5(73) = 4.98 Rounding to 4 in order to be conservative gives n 73 and c 4 Problem 4.5.3: Let p denote the probability that, for a particular tennis player, the first serve is good. Since p=0.40, this player decided to take lessons in order to increase p. When the lessons are completed, the hypothesis H 0 : p = 0.40 is tested against H : p > 0.40 based on n = 5 trials. Let y equal the number of first serves that are good, and let the critical region be defined by C = {y : y 3}. Solve (a) and (b). Solution: (a) Determine α = P (Y 3 p = 0.40). This follows directly from the cumulative distribution function of Y, P (Y 3 p = 0.40) = P (Y < 3 p = 0.40) ( ) 5 = (0.4) k (0.6) 5 k k k=0 = 0.846 = 0.54 (b) Find β = P (Y < 3 p = 0.60) so that β is the power at p = 0.60. This similarly follows directly from the cumulative distribution function of Y, ( ) 5 P (Y < 3 p = 0.60) = (0.6) k (0.4) 5 k = 0.54 k k=0 β = 0.846 4

Problem 4.6.4: Consider the one-sided t-test for H 0 : µ = µ 0 versus H A : µ > µ 0 constructed in Example 4.5.4 and the two-sided t-test for H 0 : µ = µ 0 versus H : µ µ 0 given in (4.6.9). Assume that both tests are of size α. Show that for µ > µ 0, the power function of the one-sided test is larger than the power function of the two-sided test. Solution: This follows from direct comparison of the power functions as follows. First denote the power function for the first test by γ (µ) and denote the power function of the second test by γ (µ). The power functions then follow from their definitions. ( ) x µ0 γ (µ) = P σ/ n t α(n ) µ ( ) ( x µ0 = P σ/ n t α/(n ) µ + P t α/ > x µ ) 0 σ/ n t α(n ) µ ( ) x µ 0 γ (µ) = P σ/ n t α/(n ) µ ( ) ( ) x µ0 x = P σ/ n t µ0 α/(n ) µ + P σ/ n t α/(n ) µ Therefore, for µ > µ 0, γ (µ) > γ (µ) iff, ( ) x µ0 P σ/ n t α/(n ) µ < P ( t α/ > x µ ) 0 σ/ n t α(n ) µ If the null is true and µ 0 = µ then it follows that, ( ) x µ0 P σ/ n t α/(n ) µ = α ( = P t α/ > x µ ) 0 σ/ n t α(n ) µ However, if µ > µ 0, because the true distribution is symmetric about µ but the t-values are symetric around µ 0, it follows that the interval on the RHS is closer to the true mean while the interval on the LHS is further away from the true mean. Based on the symmetry of the distribution about µ the desired inequality follows. 5

Problem 4.6.5: Assume that the weight of cereal in a 0-ounce box is N(µ, σ ). To test H 0 : µ = 0. against H : µ > 0., we take a random sample of size n = 6 and observe that x = 0.4 and s = 0.4. Solve (a) and (b). Solution: (a) Do we accept or reject H 0 at the 5% significance level? Because σ is unknown and the sample size is small, we utilize a t test. Therefore, we reject H 0 iff T = x µ 0 s/ n t 0.05(n ) From the observed values, T = 0.4 0. 0.4/4 Therefore we reject H 0 in favor of H (b) What is the approximate p-value of this test? By definition the p-value of the test is given by, = 3 and t 0.05 (5) =.753 p = P (T 3) Because T T (5) it follows from the table that, p = P (T 3) 0.005 Problem 4.6.7: Among the data collected for the WHO air quality monitoring project is a measure of suspended particles in µg/m 3. Let X and Y equal the concentration of suspended particles in µg/m 3 in the city center for Melbourne and Houston respectively. Using n = 3 observations of X and m = 6 observations of Y, we test H 0 : µ X = µ Y against H : µ X < µ Y. Solve (a) and (b). Solution: (a) Define the test statistic and critical region, assuming the unknown variances are equal. Let α = 0.05 Because the sample is small and the variances are unknown but assumed to be the same, we use a t-test with a pooled variance estimator, which is denoted Sp, where we reject iff T = (ȳ x) (µ 0,Y µ 0,X ) (n )SX t α (n + m ) where S p = + (m )S Y S n + m p m + n Which simplifies to: T = ȳ x S p + 6 3.703 where S p = 6 ()S X + (5)S Y 7

(b) If x = 7.9, s x = 5.6, ȳ = 8.7, and s y = 8.3, calculate the value of the test statistic and state your conclusions. This follows from simply substituting in the observed values. First S p is given as follows, ()(5.6) + (5)(8.3) S p = = 7.33 7 Then substituting this value yields, T = 8.7 7.9 7.33 Therefore we do not reject the null, H 0 + 6 3 = 0.868.703 Problem 4.6.8: Let p equal the proportion of drivers who use a seatbelt in a country that does not have a mandatory seat belt law. It was claimed that p=0.4. An advertising campaign was conducted to increase this proportion. Two months after the campaign, y=04 out of the random sample of n=590 drivers were wearing their seat belts. Was the campaign successful? Answer (a)-(c) Solution: (a) Define the null and alternative hypotheses. First denote p 0 = 0.4 and p = Y/n. Because we want to be conservative, we assume nothing has changed letting H 0 : p 0 = p be the null hypotheses. Then because we want to test only if there was improvement we let H : p 0 < p be the alternative hypothesis. (b) Define a critical region with an α = 0.0 significance level. Because p is the sample mean of a series of Bernoulli random variables we can use a Z test where we reject H 0 if and only if, Z = p p 0 p( p) n z 0.0 or with desired values: Z = p p 0 p( p) 590 (c) Determine the approximate p-value and state your conclusion First substituting observed values from the data yields, Z = 04/590 0.4 (04/590)( 04/590) 590 = 0.0363 0.057 =.3.36 7

Because.3 <.36 we fail to reject H 0 : p 0 = p at the significance level α = 0.0. Given this observed Z test statistic the p-value can be found. Because Z N(0, ), by definition the p-value of the test is given by: p value = P (Z.3) 0.9896 = 0.004 Problem 4.6.9: In Exercise 4..8 we found a confidence interval for the variance σ using the variance S of a random sample n arising from N(µ, σ ), where the mean is unknown. In testing H 0 : σ = σ 0 against H : σ > σ 0, use the critical region defined by (n )S /σ 0 c. That is, reject H 0 in favor of H if S cσ 0/(n ). If n = 3 and the significance level α = 0.05, determine c. Solution: First note that given the desired values, we are seeking to find is c such that: ( ()S P σ 0 ) c = 0.05 Because it is known that S /σ 0 χ () all that remains is to utilize a table to find the appropriate value. It follows that c = 3.337 Problem 4.6.0: In Exercise 4..7, in finding a confidence interval for the ratio of the variances of two normal distributions, we used a statistic S /S, which has an F-distribution when those two variances are equal. If we denote the statistic by F, we can test H 0 : σ = σ against H : σ > σ using the critical region F c. If n = 3, m =, and α = 0.05, find c. Solution: First note that given the desired values we are seeking to find c such that, ( S P S ) c = 0.05 Because it is known that S /S F (0, ) all that remains is to utilize a table to find the appropriate value. It follows that c =.75. 8