( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

Similar documents
Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Last Lecture. Wald Test

5. Likelihood Ratio Tests

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Lecture Notes 15 Hypothesis Testing (Chapter 10)

Stat 319 Theory of Statistics (2) Exercises

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Common Large/Small Sample Tests 1/55

Topic 9: Sampling Distributions of Estimators

STATISTICAL INFERENCE

Lecture 7: Properties of Random Samples

x = Pr ( X (n) βx ) =

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

Summary. Recap ... Last Lecture. Summary. Theorem

Topic 9: Sampling Distributions of Estimators

Problem Set 4 Due Oct, 12

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Topic 18: Composite Hypotheses

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Expectation and Variance of a random variable

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab

Topic 9: Sampling Distributions of Estimators

Chi-Squared Tests Math 6070, Spring 2006

Properties and Hypothesis Testing

1.010 Uncertainty in Engineering Fall 2008

Statistical Theory MT 2008 Problems 1: Solution sketches

IIT JAM Mathematical Statistics (MS) 2006 SECTION A

Statistical Theory MT 2009 Problems 1: Solution sketches

Final Examination Statistics 200C. T. Ferguson June 10, 2010

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Introductory statistics

Asymptotics. Hypothesis Testing UMP. Asymptotic Tests and p-values

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Chapter 6 Principles of Data Reduction

Lecture 33: Bootstrap

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

Power and Type II Error

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

tests 17.1 Simple versus compound

STAC51: Categorical data Analysis

Statistics 3858 : Likelihood Ratio for Multinomial Models

Lecture 11 and 12: Basic estimation theory

6 Sample Size Calculations

Kurskod: TAMS11 Provkod: TENB 21 March 2015, 14:00-18:00. English Version (no Swedish Version)

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION

Matrix Representation of Data in Experiment

Lecture 19: Convergence

Review Questions, Chapters 8, 9. f(y) = 0, elsewhere. F (y) = f Y(1) = n ( e y/θ) n 1 1 θ e y/θ = n θ e yn

Data Analysis and Statistical Methods Statistics 651

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Stat 421-SP2012 Interval Estimation Section

Efficient GMM LECTURE 12 GMM II

Last Lecture. Unbiased Test

University of California, Los Angeles Department of Statistics. Hypothesis testing

January 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

Statistical inference: example 1. Inferential Statistics


TAMS24: Notations and Formulas

Lecture 2: Monte Carlo Simulation

Bayesian Methods: Introduction to Multi-parameter Models

Chapter 4. Fourier Series

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Composite Hypotheses

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

TEACHER CERTIFICATION STUDY GUIDE

3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense,

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

6.3 Testing Series With Positive Terms

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Stat410 Probability and Statistics II (F16)

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Exercise 4.3 Use the Continuity Theorem to prove the Cramér-Wold Theorem, Theorem. (1) φ a X(1).

Statistical Inference Based on Extremum Estimators

of the matrix is =-85, so it is not positive definite. Thus, the first

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

LESSON 20: HYPOTHESIS TESTING

STAT431 Review. X = n. n )

A statistical method to determine sample size to estimate characteristic value of soil parameters

INFINITE SEQUENCES AND SERIES

Lecture 18: Sampling distributions

Unbiased Estimation. February 7-12, 2008

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Rank tests and regression rank scores tests in measurement error models

Chi-squared tests Math 6070, Spring 2014

1036: Probability & Statistics

Homework for 2/3. 1. Determine the values of the following quantities: a. t 0.1,15 b. t 0.05,15 c. t 0.1,25 d. t 0.05,40 e. t 0.

1+x 1 + α+x. x = 2(α x2 ) 1+x

Theorem. Assume the following (Cramér) conditions 1. θ θ

Transcription:

82 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Defiitio: et X be a radom sample with joit p.m/d.f. f X x θ. The geeralised likelihood ratio test g.l.r.t. of the NH : θ H 0 agaist the alterative AH : θ H 1, give the observed value x, rejects the NH at α -level of sigificace wheever θ Λ x sup θ H 0 f X x θ sup θ Θ f X x θ < c where c is such that Thus the g.l.r.t. has rejectio regio R sup Pr Λ X < c α θ H 0 x : Λ x sup θ H 0 f X x θ sup θ Θ f X x θ θ < c Example: Cosider the last example agai i which we wish to test whether productio i two plats is uiform i.e. test NH : θ 1 θ 2 agaist AH : θ 1 θ 2 where θ i is the parameter of the expoetial distributio of the compoet failure times produced i plat i, i 1, 2. Fid the g.l.r.t. procedure i this case. Solutio: We have see that the likelihood is θ f X x θ θ 1e θ 1T θ m 2 e θ 2S where T z i ad S m y i, ad the log-likelihood is l θ log θ 1 θ 1 T + log θ 2 θ 2 S Hece l θ θ 1 0 l θ θ2 0 θ 1 T 0 m θ 2 S 0 θ1 T θ 2 m S

4.4. RESTRICTED M..E AND THE G..R.T. 83 i.e. θ /T m/s ad e m m T T e m S S T S m m T S m e +m We have already see that whe the NH is true the restricted m.l.e. s are θ 1 θ 2 +m ad hece T+S m + m θ e +m T+S + m T e +m T+S S T + S T + S +m + m e +m T + S The g.l.r.t. statistic is therefore Λ x θ +m + m e +m T + S m m T S m e +m ad the rejectio regio of the g.l.r.t. is T S m costat T + S +m R {x :Λ x < c} { } T S m x :costat T + S +m < c { } T S m x : +m < c T + S { T/S } x : +m < c 1 + T/S { } U x : +m < c 1 + U where U T S z i m y i

84 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Plot of the statistic U /1+U +m U /1+U +m c 0 0 u 1 u 2 U Figure 4.8: The coditio U /1 + U +m < c is equivalet to U < u 1 ad U > u 2. From the figure above we see that the rejectio regio ca also be writte as R {x :U < u 1 or U > u 2 } { x :U m < v 1 or U m } > v 2 {x :V < v 1 or V > v 2 } where V U m T/ S/m z average of the z-values ȳ average of the y-values. To fid v 1 ad v 2 we must kow the distributio of V whe the NH is true. Sice the Z i s are expoetially distributed with parameter θ 1 the T Z i Gamma,θ 1 i.e. 2θ 1 T χ 2 2. Similarly 2θ 2 S χ 2 2m. Sice the Z i s ad Y i s are idepedet 2θ 1 T/ 2θ 2 S/m χ2 2/2 χ 2 2m/2m F 2,2m Whe the NH is true θ 1 θ 2 ad hece V T/ S/m 2θ 1T/ 2θ 2 S/m F 2,2m

4.4. RESTRICTED M..E AND THE G..R.T. 85 The values v 1 ad v 2 determiig the rejectio regio ca therefore be obtaied from the F 2,2m tables such that Pr v 1 F 2,2m v 2 1 α Example: et X be a radom sample of idepedet observatio from the Bi m,θ distributio. Fid the g.l.r.t. of NH : θ θ agaist the alterative AH : θ > θ for some fixed ad kow value θ. Solutio: The likelihood is θ;x where t x i. Cosequetly ad Thus sup θ;x θ Θ sup θ;x θ H 0 Λ x Λ t m x i θ x i 1 θ m x i θ t 1 θ m t sup θ;x θ H 0 sup θ 0,1 θ t 1 θ m t t t 1 t m t m m sup θ t 1 θ m t θ θ t t m 1 t m t t if m m θ θ t 1 θ m t if t m > θ θ;x t 1 if m θ θ t 1 θ m t sup t t θ Θ m 1 t m 1 if t mθ mθ t t m mθ m t m t if t > mθ m t if t m > θ Figure 4.9 gives the plot of Λ t from which it ca be see that it is a o-decreasig fuctio of t ad cosequetly the rejectio regio is give by R {x : Λ t < c α } {x : t > k α }

86 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION ikelihood fuctio G.l.r.t. statistic 1 likelihood θ g.l.r.t. Λt c α 0 0 t/m 1 θ 0 0 mθ t k α m Figure 4.9: The likelihood of the biomial data ad the g.l.r.t statistic where k α is such that sup Pr X R sup Pr X i > k α α level of sigificace θ θ θ θ But X i Bi m,θ. Further, sice Pr Y > k θ is icreasig i θ whe Y Bi r, θ, it follows that ad k α satisfies sup Pr θ θ m jk a+1 X i > k α Pr m j θ j 1 θ m j α X i > k α θ θ Note that there may ot exist iteger k α which satisfies the above equatio. If it does ot, we the choose k α such that m jk a+1 m j θ j 1 θ m j < α ad m jk a m j θ j 1 θ m j > α

4.5. ASYMPTOTIC FORM OF THE G..R.T. 87 i.e. k α is the smallest iteger for which the is less tha α probability that a Bi m,θ radom variable will exceed it. If is large, we ca use the Normal approximatio to the Bi m,θ distributio ad hece obtai k α as the solutio to Pr N mθ,mθ 1 θ > k α α i.e. i.e. Pr N 0, 1 > k α mθ mθ 1 θ α k α mθ mθ 1 θ z α or k α mθ + z α mθ 1 θ where z α is such that Φ z α 1 α, Φ. beig the Stadard Normal distributio fuctio. Cosequetly the rejectio regio is { } R x : x i mθ mθ 1 θ > z α 4.5 Asymptotic form of the g.l.r.t. Result: I testig the NH : θ H 0 agaist the alterative AH ; θ Θ H 0 where H 0 {θ : h 1 θ 0,h 2 θ 0,...,h r θ 0}, provided the sample size o which the test is based is large, the uder mild regularity coditios 2 log Λ X 2 log θ 2[l θ l] is approximately chi-squared distributed with r degrees of freedom whe the ull hypothesis NH is true. The critical regio of the g.l.r.t ca therefore be take as R { } x : 2 log Λ x χ 2 r,α where χ 2 r,α is take from the chi-squared tables ad is such that if W χ 2 r the Pr W χ 2 r,α α ad α level of sigificace of test. The degrees

88 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION of freedom are equal to the umber of idepedet side coditios used to specify the ull hypothesis. Note that 2 log Λ x χ 2 r idepedetly of θ as log as θ H 0. I the above Λ x is the g.l.r.t. statistic, θ is the likelihood fuctio, lθ is the lo-likelihood fuctio, θ the m.l.e. of θ ad θ the restricted m.l.e. i H 0. Sketch of proof: Sice θ Λ x sup θ θ H 0 sup θ θ Θ we have that { } 2 log Λ x 2 log log θ { } 2 l l θ where θ is the m.l.e of θ ad θ is the restricted m.l.e. However, we have see that whe bmθ ad θ emerge as turig poits of the likelihood θ ad hece of the log-likelihood l θ ad whe the ull hypothesis is true, θ ad θ are close to each other with high probability. Hece expadig l θ about θ we get to a secod order approximatio k 2 log Λ x 2 l l θi θ l i θ i 1 k k θi 2 θ i θj θ 2 l j θ j1 i θ j k k θi θ i θj θ 2 l j 4.19 θ i θ j j1 By argumets similar to those that produced 4.8 we have by the law of large umbers 2 l θ 2 l θ E I ij θ as θ i θ j θ i θ j

4.5. ASYMPTOTIC FORM OF THE G..R.T. 89 with I ij θ the i,jth elemet of the Fisher Iformatio matrix I θ. Hece for large 4.19 is further approximated by k k 2 log Λ x θi θ i θj θ j I ij j1 T θ θ I θ θ 4.20 Fially sice θ ad θ 0 are close to each other with high probability we ca itroduce the further approximatio T 2 log Λ x θ θ I θ0 θ θ But sice, as we have see, whe the ull hypothesis is true both θ ad θ are multivariate Normally distributed ad hece so is θ θ it follows that 2 log Λ x is approximately a quadratic i a Normally distributed radom vector of zero mea vector. It ca be show that such a quadratic is chisquared distributed. This is a extesio of the result that if W N 0,σ 2 the W 2 /σ 2 χ 2 1 Example et X ij, j 1, 2,..., i be the failure times of a radom sample of i electroic compoet produced selected from the productio lie of the ith maufacturer i 1, 2, 3 ad assume that they are idepedetly ad expoetially distributed with meas which may differ from maufacturer to maufacturer. Assumig that 1, 2 ad 3 are large, costruct a approximate test to test the ull hypothesis that the mea times to failure are the same for the three compaies. If 1 2 3 20 ad X 1j 3106, X 2j 5620 X 3j 3912 j j carry out the test ad report your coclusios. Solutio: {X ij } i j1 is a radom sample from the expoetial distributio with parameter θ i, mea 1/θ i i 1, 2, 3 ad we wish to test the hypothesis NH : θ 1 θ 2 θ 3 agaist AH : θ 1 θ 2 θ 3 Here θ θ 1,θ 2,θ 3 ad the likelihood of the observed values is θ 1 j1 2 3 θ 1 e θx 1j θ 2 e θx 2j θ 3 e θx 3j j1 j j1 θ 1 1 e θ 1t 1 θ 2 2 e θ 2t 2 θ 3 3 e θ 3t 3

90 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION where t 1 1 j1 x 1j, t 2 2 j1 x 2j t 3 3 j1 x 3j. The log-likelihood is Hece l θ 1 log θ 1 + 2 log θ 2 + 3 log θ 3 θ 1 t 1 θ 2 t 2 θ 3 t 3 l θ θ i 0 i θ i t i θ i i t i i 1, 2, 3 Whe NH is true i.e. whe θ 1 θ 2 θ 3 θ say the likelihood fuctio reduces to θ θ 1+ 2 + 3 e θt 1+t 2 +t 3 ad Hece i.e. Cosequetly l θ log θ 1 + 2 + 3 log θ θ t 1 + t 2 + t 3 l θ θ θ 0 1 + 2 + 3 θ t 1 + t 2 + t 3 0 θ 1 + 2 + 3 t 1 + t 2 + t 3 θ 1 θ 2 θ 3 1 + 1 + 2 + 2 + 3 3 e 1+ 2 + 3 t 1 + t 2 + t 3 ad 1 t 1 1 e 1 2 t 2 2 e 2 1 1 2 2 3 3 e 1+ 2 + 3 t 1 1 t 2 2 t 3 3 3 t 3 3 e 3 Hece g.l.r.t. statistic is θ Λ x 1 + 2 + 3 1+ 2 + 3 t 1 1 t 2 2 t 3 3 1 1 2 2 3 3 t 1 + t 2 + t 3 1+ 2 + 3 1 2 3 t1 t2 t3 t 1 2 3

4.5. ASYMPTOTIC FORM OF THE G..R.T. 91 where 1 + 2 + 3 ad t t 1 + t 2 + t 3. The asymptotic form of the g.l.r.t. statistic is [ ] 2 log Λ x 2 l l θ [ ] t t1 t2 t3 2 log 1 log 2 log 3 log 1 2 3 χ 2 2 whe the ull hypothesis is true Note that NH is specified i terms of two coditios sice θ 1 θ 2 θ 3 { θ1 θ 2 0 θ 1 θ 3 0 distributio. For the data give { 3106 + 5620 + 3912 3106 2 log Λ 2 60 log 20 log 60 20 } 5620 3912 20 log 20 log 20 20 3.6209 Sice 3.6209 < χ 2 2;0.05 5.99 there is o evidece, at the 5% level, to reject the ull hypothesis. A importat example: Observatios fall idepedetly i oe of four categories C 1,C 2,c 3 ad C 4 with respective probabilities θ 1,θ 2,θ 3,θ 4 with θ 1 + θ 2 + θ 3 + θ 4 1. the followig hypothesis is put forward NH : θ 1 β 2, θ 2 1 2 β1 β, θ 3 3 2 β1 β, θ 4 1 β 2 with β 0.6 I a radom sample of 100 such observatios the umbers fallig i the four categories were x 1 10, x 2 13, x 3 37 ad x 4 40 Perform a test of approximate 5% level of sigificace to test this hypothesis agaist the alterative that at least oe of the equalities i NH does ot hold; show that there is evidece to reject NH.

92 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Solutio: Note that NH really says θ 1 0.36, θ 2 0.12, θ 3 0.36, θ 4 0.16 Note also that sice the θ i s add up to 1, the last oe is determied as soo as we kow the first three θ i s; cosequetly NH ivolves oly 3 idepedet equatios/coditios. The likelihood of θ θ 1,θ 2,θ 3,θ 4 T for the result x i observatios i the ith category i 1, 2, 3, 4 whe are sampled is give by the multiomial probability where x 1 + x 2 + x 3 + x 4 ad! θ x 1!x 2!x 3!x 4! θx 1 1 θ x 2 2 θ x 3 3 θ x 4 4 θ x 1 1 θ x 2 2 θ x 3 3 1 θ 1 θ 2 θ 3 x 4 lθ x 1 log θ 1 + x 2 log θ 2 + x 3 log θ 3 + x 4 log θ 1 θ 2 θ 3 + cost] Hece differetiatig w.r.t. θ i i 1, 2, 3 ad equatig to zero to obtai the m.l.e. we get i.e. lθ θ i 0 x i x 4 θ i 1 θ 1 θ 2 θ i 1, 2, 3 3 θ i x i x 4 θ 4 i 1, 2, 3, 4. 4.21 Sice 4 θ i 1 addig the four equatios above gives 1 x 1 + x 2 + x 3 + x 4 θ4 θ4 x 4 x θ 4 x 4 4 Replacig this i 4.21 we get the m.l.e. θ i x i i 1, 2, 3, 4 ad hece the maximised log-likelihood fuctio l x i log θ i + cost x i log xi + cost

4.5. ASYMPTOTIC FORM OF THE G..R.T. 93 Whe NH is true H 0 is a oe poit set, amely H 0 {θ 0.36, 0.12, 0.36, 0.16}. Hece θ 0.36, 0.12, 0.36, 0.16 π 1,π 2, π 3,π 5 ad the maximized log-likelihood uder the NH is l θ x i log π i + cost Therefore [ 2 log Λx 2[l l θ] 2 x i log Note that 2 x i log xi xi π i ] x i log π i χ 2 3 π i expected umber of observatios out of to fall i the ith category whe NH is true e i x i observed umber of observatios out of that fall i the ith category o i Thus the test statistic has the form 2 log Λx 2 o i log oi e i. whe NH is true For the give results 2 log Λx 2 10 log 10 13 37 40 + 13 log + 37 log + 40 log 25.90 36 12 36 16 > χ 2 3,0.05 7.815 There is, therefore, evidece at the 5% level of sigificace agaist the ull hypothesis which is therefore rejected.

94 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION b We ow chage the ull hypothesis to NH : θ 1 β 2, θ 2 1 2 β1 β, θ 3 3 2 β1 β, θ 4 1 β 2 with β uspecified i 0,1. Notice that this ull hypothesis is specified i terms of oly two idepedet equatios/coditios. As before the equatio for theta 4 is redudat; further, sice β θ 1, the oly idepedet equatios are θ 2 1 2 θ1 1 θ 1, θ 3 3 2 θ1 1 θ 1 Thus whe the ull hypothesis NH is true the g.l.r.t. statistic 2 log Λx χ 2 2. Now as i part a whe NH is ot true l x i log xi + cost Whe NH is true the likelihood is θ β! x 1!x 2!x 3!x 4! β2x 1 1 β1 β 2 β 2x 1+x 2 +x 3 1 β x 2+x 3 +2x 4 β N 1 β M x2 3 x3 β1 β 1 β 2x 4 2 where N 2x 1 + x 2 + x 3 ad M x 2 + x 3 + 2x 4. Thus whe NH is true the log-likelihood is ad i.e. the m.l.e. β of β satisfies lβ N log β + M log1 β + costat lβ β 0 N β M 1 β 0 N1 β M β 0 β N N + M N 2 For the give data β N 2 70 200 0.35.

4.5. ASYMPTOTIC FORM OF THE G..R.T. 95 The restricted m.l.e. s of the θ i s whe NH is true are θ 1 β 2 π 1 β, θ2 1 2 β1 β π 2 β, θ 3 3 2 β1 β π 3 β, θ4 1 β 2 π 4 β Thus l θ x i log θ i + cost x i log π i β + cost so that 2 log Λx 2[l l θ] x i log xi xi x i log π i β x i log π β Note agai that π i β estimated expected umber fallig i the ith category e i Thus the g.l.r.t. statistic still has the form For the give data 2 log Λx 2 o i log oi e 1 100 0.35 2 12.25, e 2 100 1 0.35 0.65 11.375, 2 e i e 3 100 3 2 0.35 0.65 34.125, e 4 100 0.65 2 42.25 ad [ 2 log Λx 2 10 log 10 ] 12.25 + 13 log 13 11.375 + 37 log 37 40 + 40 log 34.125 42.25 1.02 < χ 2 2,0.05 5.991

96 CHAPTER 4. MAXIMUM IKEIHOOD ESTIMATION Thus there is o evidece at the 5% level of sigificace to reject the ull hypothesis. This example ca be geeralized. Result: The multiomial test Suppose each observatio ca fall i oe of k categories C 1,C 2,...,C k with PrC i θ i, i 1, 2,...,k, k θ i 1, ad that observatios are take idepedetly with x i fallig i C i, i 1, 2,...,k, k x i. Thus the sample joit p.m.f. is f X x θ! x 1!x 2!...x k! θx i 1 θ x2 θ x k k We formulate the ull hypothesis that states that the θ i s follow the model NH : θ i π i β i 1, 2,...,k with the π i s give fuctios ivolvig a ukow s-dimesioal parameter β if o β parameter is ivolved i the model the s 0 ad the π is i NH are give values. For large, the asymptotic form of the g.l.r.t. statistic is 2 log Λx k χ 2 k 1 s xi x i log π i β k o i log if s 0 the π i β π i, the give umerical value i NH where β is the m.l.e. of β uder the ull hypothesis i.e β maximises! Q k x i! k [π iβ] x i. oi e i