Topic 19 Extensions on the Likelihood Ratio

Similar documents
Topic 15: Simple Hypotheses

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes

Topic 17: Simple Hypotheses

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

Hypothesis Testing: The Generalized Likelihood Ratio Test

Topic 17 Simple Hypotheses

Math 152. Rumbos Fall Solutions to Assignment #12

Topic 12 Overview of Estimation

Chapter 7. Hypothesis Testing

Summary of Chapters 7-9

Topic 21 Goodness of Fit

Exercises and Answers to Chapter 1

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Topic 22 Analysis of Variance

I i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic.

Institute of Actuaries of India

MATH5745 Multivariate Methods Lecture 07

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

Hypothesis Testing Chap 10p460

Composite Hypotheses. Topic Partitioning the Parameter Space The Power Function

Loglikelihood and Confidence Intervals

Hypothesis testing: theory and methods

MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30

2. What are the tradeoffs among different measures of error (e.g. probability of false alarm, probability of miss, etc.)?

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

Statistical Inference

Likelihood Ratio tests

A Very Brief Summary of Statistical Inference, and Examples

Ch. 5 Hypothesis Testing

Lecture 15. Hypothesis testing in the linear model

Lecture 5: Likelihood ratio tests, Neyman-Pearson detectors, ROC curves, and sufficient statistics. 1 Executive summary

Central Limit Theorem ( 5.3)

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation

Hypothesis Testing. A rule for making the required choice can be described in two ways: called the rejection or critical region of the test.

Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS

TUTORIAL 8 SOLUTIONS #

Direction: This test is worth 250 points and each problem worth points. DO ANY SIX

Power and sample size calculations

2.6.3 Generalized likelihood ratio tests

Primer on statistics:

Notes on the Multivariate Normal and Related Topics

F79SM STATISTICAL METHODS

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1

Mathematical statistics

Chapters 10. Hypothesis Testing

f(x θ)dx with respect to θ. Assuming certain smoothness conditions concern differentiating under the integral the integral sign, we first obtain

Solution: First note that the power function of the test is given as follows,

Math 494: Mathematical Statistics

Topic 16 Interval Estimation

Mathematical Statistics

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

1 Statistical inference for a population mean

Lecture 10: Generalized likelihood ratio test

557: MATHEMATICAL STATISTICS II HYPOTHESIS TESTING: EXAMPLES

Quality Control Using Inferential Statistics In Weibull Based Reliability Analyses S. F. Duffy 1 and A. Parikh 2

simple if it completely specifies the density of x

7 Estimation. 7.1 Population and Sample (P.91-92)

BEST TESTS. Abstract. We will discuss the Neymann-Pearson theorem and certain best test where the power function is optimized.

Composite Hypotheses and Generalized Likelihood Ratio Tests

Definition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution.

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:

Statistics 135 Fall 2008 Final Exam

Power and Sample Size Bios 662

LECTURE 10: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING. The last equality is provided so this can look like a more familiar parametric test.

Outline of GLMs. Definitions

Hypothesis Testing - Frequentist

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Parametric Techniques Lecture 3

Chapter 8: Hypothesis Testing Lecture 9: Likelihood ratio tests

Power and the computation of sample size

Asymptotic Tests and Likelihood Ratio Tests

Parametric Techniques

A Likelihood Ratio Test

Statistics 3858 : Maximum Likelihood Estimators

Statistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Testing. Confidence intervals on mean. CL = x ± t * CL1- = exp

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.

Topic 16 Interval Estimation

10. Composite Hypothesis Testing. ECE 830, Spring 2014

A Very Brief Summary of Statistical Inference, and Examples

Statistics and econometrics

Tables Table A Table B Table C Table D Table E 675

Mathematical statistics

Some General Types of Tests

Chapter 8 - Statistical intervals for a single sample

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Detection theory. H 0 : x[n] = w[n]

If there exists a threshold k 0 such that. then we can take k = k 0 γ =0 and achieve a test of size α. c 2004 by Mark R. Bell,

Regression Estimation - Least Squares and Maximum Likelihood. Dr. Frank Wood

Hypothesis Testing One Sample Tests

Let us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided

STAT 514 Solutions to Assignment #6

Practical Econometrics. for. Finance and Economics. (Econometrics 2)

This paper is not to be removed from the Examination Halls

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

Topic 10: Hypothesis Testing

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER. 21 June :45 11:45

Transcription:

Topic 19 Extensions on the Likelihood Ratio Two-Sided Tests 1 / 12

Outline Overview Normal Observations Power Analysis 2 / 12

Overview The likelihood ratio test is a popular choice for composite hypothesis when Θ 0 is a subspace Θ the parameter space. The rationale for this approach is that the null hypothesis is unlikely to be true if the maximum likelihood on Θ 0 is sufficiently smaller that the likelihood maximized over Θ. Let ˆθ0 be the parameter value that maximizes the likelihood for θ Θ 0 and ˆθ be the parameter value that maximizes the likelihood for θ Θ. The likelihood ratio Λ(x) = L(ˆθ 0 x) L(ˆθ x). 3 / 12

Overview We have two optimization problems - maximize L(θ x) on the parameter space Θ and on the null hypothesis space Θ 0. The critical region for an α-level likelihood ratio test is {Λ(x) λ α }. As with any α level test, λ α is chosen so that P θ {Λ(X ) λ α } α for all θ Θ 0. NB. This ratio is the reciprocal from the version given by the Neyman-Pearson lemma. Thus, the critical region consists of those values that are below a critical value. 4 / 12

Normal Observations Consider the two-sided hypothesis H 0 : µ = µ 0 versus H 1 : µ µ 0. Here the data are n independent N(µ, σ 0 ) random variables with known variance σ0 2. The parameter space Θ is one dimensional giving the value µ for the mean. As we have seen before ˆµ = x. Θ 0 is the single point {µ 0 } and so ˆµ 0 = µ 0. L(ˆµ 0 x) = and ( Notice that ) n 1 exp 1 2πσ 2 0 2σ0 2 ( n (x i µ 0 ) 2, L(ˆµ x) = i=1 ) n 1 exp 1 2πσ 2 0 2σ0 2 ( n ) Λ(x) = exp 1 2σ0 2 ((x i µ 0 ) 2 (x i x) 2 ) = exp n 2σ 2 ( x µ 0 ) 2. i=1 0 2 ln Λ(x) = n σ0 2 ( x µ 0 ) 2 = ( ) 2 x µ0 σ 0 /. n n (x i x) 2 i=1 5 / 12

Then, critical region, {Λ(x) λ α } = Normal Observations { ( ) } x 2 µ0 σ 0 / 2 ln λ α. n Under the null hypothesis, ( X µ 0 )/(σ 0 / n) is a standard normal random variable, and thus 2 ln Λ(X ) is the square of a single standard normal. This is the defining property of a χ-square random variable with 1 degree of freedom. Naturally we can use both ( ) x 2 µ0 σ 0 / and n x µ 0 σ 0 / n. as a test statistic. We have seen the second choice in the example of a possible invasion of a model butterfly by a mimic. 6 / 12

For the two-sided two-sample α-level likelihood ratio test for population proportions p 1 and p 2, based on the hypothesis H 0 : p 1 = p 2 versus H 1 : p 1 p 2, 1 we maximize the likelihood over the subspace Θ 0 = {(p 1, p 2 ); p 1 = p 2 } (the blue line) and over the entire parameter space, Θ = [0, 1] [0, 1], shown as the square, and then take the ratio, simplify and make appropriate approximations. p 2 0.5 0 0 0.5 1 p 1 The data are observations on n 1 Bernoulli trials, x 1,1, x 1,2,..., x 1,n1 from the first population and, independently, n 2 Bernoulli trials, x 2,1, x 2,2,..., x 2,n2 from the second. 7 / 12

The likelihood ratio test is approximately equivalent to the critical region z z α/2 where z = ˆp 1 ˆp 2 ( ) ˆp 0 (1 ˆp 0 ) 1 + 1 n1 n2 with ˆp i, the sample proportion of successes from the observations from population i and ˆp 0, the pooled proportion ˆp 0 = 1 n 1 + n 2 ((x 1,1 + + x 1,n1 ) + (x 2,1 + + x 2,n2 )) = n 1ˆp 1 + n 2ˆp 2 n 1 + n 2. 8 / 12

The subsequent winter had 167 out of 250 hives surviving. To test if the two survival probabilities are significantly different: > prop.test(c(250,167),c(332,250)) 2-sample test for equality of proportions with continuity correction data: c(250, 167) out of c(332, 250) X-squared = 4.664, df = 1, p-value = 0.0308 alternative hypothesis: two.sided 95 percent confidence interval: 0.006942351 0.163081746 sample estimates: prop 1 prop 2 0.753012 0.668000 9 / 12

Exercise. 1. Give the 95% confidence interval for the difference in proportions. Does it contain the value 0? 2. Modify the R command to find a 98% confidence interval for the difference in proportions. Does it contain the value 0? 3. Explain your answers to the first two parts - why does one contain 0 and the other not? Could you have guessed this in advance by looking at the p-value. 4. Assume that the second winter was more severe. State a hypothesis for appropriate for this situation and modify the R commands and give the p-value. Could you have given this p-value based on the information in the output on the previous slide? 10 / 12

Power analyses can be executed in R using the power.prop.test command. If we want to be able to detect a difference between two proportions p 1 = 0.7 and p 2 = 0.6 in a one-sided test with a significance level of α = 0.05 and power 1 β = 0.8. > power.prop.test(p1=0.70,p2=0.6,sig.level=0.05,power=0.8, alternative = c("one.sided")) Two-sample comparison of proportions power calculation n = 280.2581 p1 = 0.7 p2 = 0.6 sig.level = 0.05 power = 0.8 alternative = one.sided NOTE: n is number in *each* group We will need a sample of n = 281 from each group. 11 / 12

If we vary p 2 and determine the power. > power.prop.test(n=250,p1=0.70,p2=c(0.6,0.65),sig.level=0.05, alternative = c("one.sided")) p2 = 0.60, 0.65 power = 0.7589896, 0.3256442 Now, let s vary sample size. > power.prop.test(n=c(250,350,450,550),p1=0.70,p2=0.60,sig.level=0.05, alternative = c("one.sided")) n = 250, 350, 450, 550 power = 0.7589896, 0.8717915, 0.9342626, 0.9672670 Exercise. Determine the reduction in power when the significance level α = 0.02 for the sample sizes above. Why is the power reduced? 12 / 12