The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80

Similar documents
Chapter 9: Hypothesis Testing Sections

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

Chapter 6. Hypothesis Tests Lecture 20: UMP tests and Neyman-Pearson lemma

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes

40.530: Statistics. Professor Chen Zehua. Singapore University of Design and Technology

TUTORIAL 8 SOLUTIONS #

Hypothesis Testing Chap 10p460

STATISTICS SYLLABUS UNIT I

Institute of Actuaries of India

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

STA 732: Inference. Notes 2. Neyman-Pearsonian Classical Hypothesis Testing B&D 4

Let us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided

STAT 830 Hypothesis Testing

STAT 830 Hypothesis Testing

Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

Lecture 21. Hypothesis Testing II

Lecture 12 November 3

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

Subject CS1 Actuarial Statistics 1 Core Principles

Methods of evaluating tests

Derivation of Monotone Likelihood Ratio Using Two Sided Uniformly Normal Distribution Techniques

Statistics Ph.D. Qualifying Exam: Part I October 18, 2003

Testing Statistical Hypotheses

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Summary of Chapters 7-9

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER. 21 June :45 11:45

Homework 7: Solutions. P3.1 from Lehmann, Romano, Testing Statistical Hypotheses.

Math 494: Mathematical Statistics

Hypothesis Testing. Testing Hypotheses MIT Dr. Kempthorne. Spring MIT Testing Hypotheses

14.30 Introduction to Statistical Methods in Economics Spring 2009

Chapters 10. Hypothesis Testing

Lecture 8: Information Theory and Statistics

Detection Theory. Composite tests

Charles Geyer University of Minnesota. joint work with. Glen Meeden University of Minnesota.

Definition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution.

Chapters 10. Hypothesis Testing

Stat 5102 Final Exam May 14, 2015

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.

STAT 801: Mathematical Statistics. Hypothesis Testing

Direction: This test is worth 250 points and each problem worth points. DO ANY SIX

1. (Regular) Exponential Family

j=1 π j = 1. Let X j be the number

Chapter 4. Theory of Tests. 4.1 Introduction

Hypothesis Testing - Frequentist

On the GLR and UMP tests in the family with support dependent on the parameter

simple if it completely specifies the density of x

DA Freedman Notes on the MLE Fall 2003

A Very Brief Summary of Statistical Inference, and Examples

Practice Problems Section Problems

Institute of Actuaries of India

Probability Distributions Columns (a) through (d)

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing

ECE531 Screencast 11.4: Composite Neyman-Pearson Hypothesis Testing

Hypothesis Testing One Sample Tests

Mathematical statistics

Exercises Chapter 4 Statistical Hypothesis Testing

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

Recall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

557: MATHEMATICAL STATISTICS II HYPOTHESIS TESTING: EXAMPLES

Testing Statistical Hypotheses

8: Hypothesis Testing

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics

Master s Written Examination - Solution

Some General Types of Tests


Lecture 8: Information Theory and Statistics

Review. December 4 th, Review

HANDBOOK OF APPLICABLE MATHEMATICS

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

Summary: the confidence interval for the mean (σ 2 known) with gaussian assumption

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

STAT 461/561- Assignments, Year 2015

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Spring 2012 Math 541B Exam 1

Reading Material for Students

Composite Hypotheses and Generalized Likelihood Ratio Tests

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Tests and Their Power

Chapter 6 Expectation and Conditional Expectation. Lectures Definition 6.1. Two random variables defined on a probability space are said to be

Non-parametric Inference and Resampling

14.30 Introduction to Statistical Methods in Economics Spring 2009

Ch. 5 Hypothesis Testing

Kumaun University Nainital

F79SM STATISTICAL METHODS

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018

A Course in Statistical Theory

BTRY 4090: Spring 2009 Theory of Statistics

A General Overview of Parametric Estimation and Inference Techniques.

Dr. Maddah ENMG 617 EM Statistics 10/15/12. Nonparametric Statistics (2) (Goodness of fit tests)

Math 494: Mathematical Statistics

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006

BEST TESTS. Abstract. We will discuss the Neymann-Pearson theorem and certain best test where the power function is optimized.

Mathematical Statistics

Theory of Probability Sir Harold Jeffreys Table of Contents

Master s Written Examination

Lecture 7 Introduction to Statistical Decision Theory

Statistics 135 Fall 2008 Final Exam

Transcription:

The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80 71. Decide in each case whether the hypothesis is simple [ S ] or composite[ C ]: [ S ] The hypothesis that a random variable has a gamma distribution with α=3 and β=2. [ C ] The hypothesis that a random variable has a gamma distribution with α=3 and β 2. [ C ] The hypothesis that a random variable has an exponential density. [ C ] The hypothesis that a random variable has a beta distribution with the mean μ=0.50. [ S ] The hypothesis that a random variable has a Poisson distribution with λ=1.25. [ C ] The hypothesis that a random variable has a normal distribution with λ>1.25. [ C ] The hypothesis that a random variable has a negative binomial distribution with k=3 and θ<0.60. 72. Fill in the blanks. (a) A single observation of a random variable having a hypergeometric distribution with and is used to test the null hypothesis against the alternative hypothesis. The null hypothesis is rejected if and only if the value of the random variable is 2. The sample space of the test statistic is { }. On the sample space of the test statistic, the critical region is { }. The power function is ( ) probability is ( ). ( )( ) ( ) ( ), the Type 1 error probability is ( ), the Type 2 error (b) A single observation of a random variable having a geometric distribution (p.m.f. ( ) ) is used to test the null hypothesis against the alternative hypothesis. The null hypothesis is rejected if and only if the observed value of the random variable is greater than or equal to the positive integer. The sample space of the test statistic is { }. On the sample space of the test statistic, the critical region is { }. The power function is ( ) ( ), the Type 1 error probability is, the Type 2 error probability is.

(c) A single observation of a random variable having an exponential distribution is used to test the null hypothesis that the mean of the distribution is against the alternative that it is. The null hypothesis is accepted if and only if the observed value of the random variable is less than 3. The sample space of the test statistic is ( ). On the sample space of the test statistic, the critical region is [ ). The power function is ( ), the Type 1 error probability is 0.2231, the Type 2 error probability is 0.4512. (d) A single observation of a random variable having a uniform density with is used to test the null hypothesis against the alternative hypothesis. The null hypothesis is rejected if and only if the random variable takes on a value greater than. The sample space of the test statistic is [ ). On the sample space of the test statistic, the critical region is ( ). The power function is ( ) ( ), the Type 1 error probability is 0, the Type 2 error probability is. 73. Filling the blanks. (a) Let and constitute a random sample from a normal population with. If the null hypothesis is to be rejected in favor of the alternative hypothesis when, then the size of the test is ( ). (b) Let and constitute a random sample of size 2 from the population given by ( ) ( ). If the critical region is used to test the null hypothesis against the alternative hypothesis, then the power of this test at is ( ) ( ) ( ) ( ) [ ]. 74. With i.i.d. data { }

we would like to test hypotheses about a parameter of the statistical distribution of : Natrually, we may directly compare their likelihoods: via the likelihood ratio ( { } ) ( { } ) whose only variable argument is the data vector { } as both and are fixed constants. This means the likelihood ratio is a statistic. We use it as the test statistic to specify the following critical region { } where is a constant to be specified according to the size of the test, that is, ( ) Suppose a different test for the same pair of hypotheses uses the critical region with the same size, that is, ( ) The power of the -test at is ( ) ( ) The power of the -test at is ( ) ( ) Show that ( ) ( ). Proof ( ) ( ). End of Proof. Therefore, any simple-vs-simple test based on { } is the most powerful test among all tests with size. 75. Explain why a statistical hypothesis test is identified by its critical region (on the data s joint sample space), not by its test statistic. Explain why the power function characterizes a statistical hypothesis test. Answer. It is clear that underlying the data is the joint sample space of iid random variables and their joint distribution. Any statistic defined on the data, hence taken the data s joint sample space as domain, will only bring about a simplified sample space (the range of the statistic) and a sampling distribution thereon (on the range)

commeasurable with the data s joint distribution (on the domain of the statistic). Any critical region specified on the statistic s range automatically specifies an unambiguous critical region on the data s joint sample space (the domain the statistic) because the latter is the pre-image of the former under the statistic. As the null hypothesis is judged on the range of the statistic by observing whether the realization of the statistic falls in the critical region here, an equivalent judgment is simultaneously carried out on the joint sample space of the data by observing whether the realization of the data vector falls in the pre-image critical region. An important difference between the two critical regions is concerned with their sophistication: the data s joint sample space is at least as sophisticated as any statistic s range. Therefore, not all critical regions on the data s joint sample space is identifiable from the statistic s range some sophisticated structure is just lost in the statistic s simplification effort. In the extreme case, a barbarous statistic may manipulate the distribution on its range by concentrate those low-density regions of the domain to artificially create a high density region on the range and spread those high-density regions widely on its domain to artificially create a vast low-density region on its range. For this question, we note that the judgment of the null hypothesis is equivalent to the indication of the critical region, regardless of whether the judgment is carried out on the statistic s range or directly on the data s joint sample space. And since all possible critical regions on the range of any statistic is always identifiable on the data s joint sample space, whereas the converse is not true, we prefer to say the test is identifiable from the critical region on the data s joint sample space, and this critical region exists independently of the construction of any statistic. The power function gives the probability measure of the critical region for each parameter in the parameter space. It describes the interaction of the test (critical region) and the parameter using a probability, which can be interpreted as the likelihood of the parameter given the indicator function of the critical region (as a binary random variable). This likelihood holds all information about the parameter given the test s critical region. 76. A size- random sample from a normal population with is to be used to test the null hypothesis against the alternative hypothesis, where. Show that the most powerful size- critical region does not depend on the value of. Solution. This is a simple-vs-simple test. The Neyman-Pearson Lemma is in effect. The null likelihood is. Likelihood ratio of Null-over-Alternative: ; The alternative likelihood is [( ) ( ) ] [( )( )] ( ) ( ) ( ) ( ) ( )

where are constants linked to the size of the test. Thus if the sample mean is used as a test statistic, the size- critical region on the range of is the half-line { }. (for the normal upper α-quantile, e.g.,.) Clearly this does not depend on and this is the most powerful critical region by the Neyman-Pearson Lemma. The independence of means that the critical region is the uniformly most powerful if the alternative hypothesis has been changed to. 77. A size- random sample from an exponential population (use the density form where the parameter is reciprocal of the mean) is used to test the null hypothesis against the alternative hypothesis. Construct the most powerful size- critical region for this pair of hypotheses and argue that it is the uniformly most powerful critical region when the alternative hypothesis has been changed to the composite hypothesis. Solution. This is a simple-vs-simple test. The Neyman-Pearson Lemma is in effect. ;. ( ) where are constants linked to the size of the test. The Gamma distribution result is due to the fact that iid sum of exponentials has a Gamma distribution with and the exponential s mean (= here). Thus if the sample sum is used as a test statistic, the size- critical region on the range of is the half-line {}, where ( ) is the upper-α quantile of the distribution. Clearly this does not depend on and this is the most powerful critical region by the Neyman-Pearson Lemma. The independence of means that the critical region is the uniformly most powerful if the alternative hypothesis has been changed to. 78. A size- random sample from a normal population with is used to test the null hypothesis against the alternative hypothesis. Find the uniformly most powerful size- critical region. Solution. We start with the simple alternative hypothesis so that the Neyman-Pearson is in effect. Then we only need to show that the MP critical region is independent of meaning the test of the original simpe-vs-composite hypotheses is UMP over. Now test vs : ;. where are constants linked to the size of the test. Thus, if the sum of squared and standardized sample is used as the test statistic, the size- critical region on the range of is the half-line { } where ( ) is the upper α-

quantile of the distribution with degrees of freedom. The critical region does not depend on meaning the MP critical region for the particular simple-vs-simple hypotheses is the UMP critical region for our original simple-vs-composite hypotheses. 79. When we test a simple null hypothesis against a composite alternative, a critical region is said to be unbiased if the corresponding power function takes on its minimum value at the value of the parameter assumed under the null hypothesis. In other words, a critical region is unbiased if the probability of rejecting the null hypothesis is least when the null hypothesis is true. Given a single observation of the random variable having the density ( ) ( ) { ( )} where, show that the critical region provides an unbiased critical region of size for testing the null hypothesis against the alternative hypothesis. Solution. The power function is ( ) ( ) ( ) ( ) which attains the minimum value iff. Therefore the critical region ( ] is unbiased. 80. A size- random sample from a normal population with unknown mean and variance is to be used to test the null hypothesis against the alternative. Show that the (generalized) likelihood ratio statistic can be written in the form ( ) where. Then show that. Solution. The generalized likelihood ratio is generalized from the simple likelihood ratio for testing composite-vs-composite hypotheses. The GLR is defined as where is the null likelihood as usual, while is the unconstrained likelihood as we use in an unconstrained estimation problem. Unfortunately, the Neyman-Pearson Lemma is dropped during the generalization. The Wilk s Theorem gives the asymptotic distribution of a slight 1-1 transformation on : ( ) where is the degrees of freedom of the full parameter space and is that of the null space. But here we do not cite the Wilk s theorem.

[ ] [ ( ) ] [ ( ) ] ( ( ) ) Fact: ( ) ( ) ( ) Therefore and ( ( ) ( ) ( ) ) ( ( ) ( ) ) ( ) ( )