LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if

Similar documents
Introductory statistics

Summary. Recap ... Last Lecture. Summary. Theorem

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Asymptotics. Hypothesis Testing UMP. Asymptotic Tests and p-values

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Efficient GMM LECTURE 12 GMM II

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

STATISTICAL INFERENCE

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

Properties and Hypothesis Testing

Stat 200 -Testing Summary Page 1

Last Lecture. Wald Test

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

1.010 Uncertainty in Engineering Fall 2008

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Lecture 10 October Minimaxity and least favorable prior sequences

7.1 Convergence of sequences of random variables

Lecture 19: Convergence

A statistical method to determine sample size to estimate characteristic value of soil parameters

32 estimating the cumulative distribution function

Lecture 33: Bootstrap

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Output Analysis and Run-Length Control

Lecture Notes 15 Hypothesis Testing (Chapter 10)

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Convergence of random variables. (telegram style notes) P.J.C. Spreij

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

January 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS

Problem Set 4 Due Oct, 12

Frequentist Inference

1 Convergence in Probability and the Weak Law of Large Numbers

Point Estimation: properties of estimators 1 FINITE-SAMPLE PROPERTIES. finite-sample properties (CB 7.3) large-sample properties (CB 10.

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

x = Pr ( X (n) βx ) =

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Statistical Inference Based on Extremum Estimators

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

Topic 9: Sampling Distributions of Estimators

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

Probability and Statistics

Last Lecture. Unbiased Test

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

5. Likelihood Ratio Tests

Rates of Convergence by Moduli of Continuity

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Chapter 6 Principles of Data Reduction

2.2. Central limit theorem.

1 Review and Overview

Asymptotic Results for the Linear Regression Model

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 12: September 27

LECTURE 8: ASYMPTOTICS I

Topic 9: Sampling Distributions of Estimators

Lecture 27: Optimal Estimators and Functional Delta Method

Lecture 11 and 12: Basic estimation theory

Stat 319 Theory of Statistics (2) Exercises

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

Statistical inference: example 1. Inferential Statistics

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

MA238 Assignment 4 Solutions (part a)

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Lecture 2: Monte Carlo Simulation

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Module 1 Fundamentals in statistics

Stat 421-SP2012 Interval Estimation Section

Advanced Stochastic Processes.

Output Analysis (2, Chapters 10 &11 Law)

Topic 9: Sampling Distributions of Estimators

STAT431 Review. X = n. n )

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

1 Introduction to reducing variance in Monte Carlo simulations

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.

Estimation for Complete Data

Chi-Squared Tests Math 6070, Spring 2006

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

7.1 Convergence of sequences of random variables

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

The standard deviation of the mean

Lecture 3: August 31

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random

Lecture 11 October 27

Math 140 Introductory Statistics

Random Variables, Sampling and Estimation

STAT Homework 1 - Solutions

4. Partial Sums and the Central Limit Theorem

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

Lecture 7: Properties of Random Samples

Lecture Note 8 Point Estimators and Point Estimation Methods. MIT Spring 2006 Herman Bennett

Transcription:

LECTURE 14 NOTES 1. Asymptotic power of tests. Defiitio 1.1. A sequece of -level tests {ϕ x)} is cosistet if β θ) := E θ [ ϕ x) ] 1 as, for ay θ Θ 1. Just like cosistecy of a sequece of estimators, Defiitio 1.1 is a basic otio of correctess. I fact, most tests are cosistet. I the rest of the sectio, we refrai from presetig mathematically rigorous results because the level of the subject is such that it is difficult to state the eve the assumptios without itroducig additioal techical cocepts. Cosider testig H 0 : θ = θ 0 versus H 1 : θ θ 0 by a Wald test. That is, cosider the test ϕx) = 1 χ p,, ) w ), where w is the Wald test statistic: 1.1) w := V 1 ) ˆθ θ 0 ˆθ is a asymptotically ormal estimator of θ, ad V is a cosistet estimator of its asymptotic variace). The power is βθ) = w χ ) p, At ay θ 1 θ 0, the Wald statistic diverges. Ideed, V 1 ˆθ θ 0 ) = V 1 ) ˆθ θ 1 + V 1 ) θ1 θ 0. We recogize the first term is O P 1) ˆθ is asymptotically ormal), but the secod term diverges. Thus the power teds to oe. It is possible to show that the LR ad score tests are cosistet by similar argumets. Cosistecy esures the power of a test grows to oe as the sample size grows. However, the rate of covergece is uclear. Whe we ecoutered a similar problem whe evaluatig poit estimators, we blew up the error by ad studied the limitig distributio of ˆθ θ ). 1

STAT 01B The aalogous trick here is to study the limitig distributio of the Wald statistic uder a sequece of local alteratives: 1.) θ := θ 0 + h. Formally, cosider a triagular array x 1,1 x,1 x,..... x,1 x,... x,....., where x,i i.i.d. F θ. We remark that observatios i differet rows of the array are ot idetically distributed. Let ˆθ be a asymptotically ormal estimator of θ based o observatios {x,i } i []. The Wald statistic is V 1 ) ˆθ θ 0 = V 1 ) ˆθ θ + V 1 h. Ituitively, the first term coverges i distributio to a N 0, I p ) radom variable, ad the secod term coverges to Avar ˆθ ) 1 h. Thus 1.3) V 1 ) d ) 1 ) ˆθ θ 0 N Avar ˆθ h, I p. ad the power fuctio coverges to z ) 1 βθ ) P + Avar ˆθ h ) χ p,, where z N 0, I p ). The precedig limit of the power fuctio is called the asymptotic power of the Wald test. Evidetly, the larger V 1 h is, the higher is the asymptotic power. Thus Wald tests based o efficiet estimators are more powerful. There is a similar story for the LR ad score tests. We remark that z + µ is distributed as a o-cetral χ radom variable: µ is the o-cetrality parameter. Example 1.. test. We kow Let x i i.i.d. Berp). We wish to test H 0 : p = p 0 by a Wald 1. the MLE of p is ˆp = x,. the asymptotic variace of ˆp is p1 p).

LECTURE 14 NOTES 3 If p p 0, the power of the Wald test is approximately ) ) βp) = P ˆp p 0 ) p χ p1 p) 1, ) ) = P ˆp p) p + p p0 ) χ p1 p) p1 p) 1, ) P z + p p0 ) χ p1 p) 1, ), where z is a stadard ormal radom variable. The o-cetrality parameter is p p 0) p1 p). To illustrate use of the precedig approximate power fuctio i a cocrete settig, cosider the desig questio: how may samples are required to achieve 0.9 power agaist the alterative H 1 : p 1 = p 0 + 0.1? By the properties of the o-cetral χ 1 distributio, to esure P z + µ) χ 1,) 0.90, the o-cetrality parameter µ must be at least 10.51. Recall the o-cetrality parameter is p 1 p 0 ) p 1 1 p 1 ). We solve for to deduce > 10.51p 11 p 1 ) p 1 p 0 ) = 6.65. It is possible to rigorously justify 1.1) uder suitable coditios by appeaig to the theory of local asymptotic ormality, much of which was developed by Lucie Le Cam at Berkeley.. Iterval estimatio. I the first part of the course, we cosidered the task of poit estimatio, where the goal is to provide a sigle poit that is a guess for the value of the ukow parameter. The goal of iterval estimatio is to provide a set that cotais the ukow parameter with some prescribed probability. Defiitio.1. Let Cx) Θ be a set-valued radom variable. It is a 1 -cofidece set for a parameter θ if θ Cx)) 1. If Cx) is a iterval o R, we call Cx) a cofidece iterval. We emphasize that the set Cx) ot the parameter θ is the radom quatity i Defiitio.1. Observig [ lx), ux) ] = [l, u] should ot be iterpreted as θ [l, u] with probability at least 1 : θ is a determiistic

4 STAT 01B quatity, so it is osese to cosider the probability of θ [l, u]. A correct statemet is θ [lx), ux)] with probability at least 1. As we shall see, formig iterval estimators essetially boil dow to ivertig hypothesis tests. Lemma.. Let Aθ 0 ) X be the acceptace regio of a -level test of H 0 : θ = θ 0. The set A 1 x), where A 1 x) := {θ Θ : x Aθ)} is a 1 cofidece set for θ 0. That is, the set of parameters θ 0 at which a -level test of H 0 : θ = θ 0 accepts is a 1 -cofidece set for θ 0. Proof. By defiitio of A 1 x), the evet {x Aθ 0 )} is equivalet to {θ 0 A 1 x)}. Uder H 0, a -level test accepts with probability at least 1 : 0 x Aθ 0 )). Thus A 1 x) is a 1 cofidece set: 0 θ 0 A 1 x)) = 0 x Aθ 0 )) 1. Lemma. is essetially a tautology: A 1 x) is a 1 -cofidece set because the evet { θ A 1 x) } is equivalet to { x Aθ) }. Sice Aθ) is the acceptace regio of a -level test, 0 x Aθ 0 )) 1. Example.3. Let x N µ, 1). Cosider testig H 0 : µ = µ 0 versus H 1 : µ µ 0. We showed that the -level LRT of H 0 : µ = µ 0 rejects if x µ 0 > z. The acceptace regio Aµ 0 ) is { } x R : x µ 0 z. Thus A 1 x) = { } µ 0 R : x µ 0 z is a 1 -cofidece iterval for µ 0. = [ x z, x + z i.i.d. Example.4. Let x i N µ, σ ), where σ is ukow. Cosider testig H 0 : µ = µ 0 versus H 1 : µ µ 0. We showed that the -level LRT of H 0 : µ = µ 0 rejects if φx) > t, where φx) = x µ0 ) ŝ. ]

LECTURE 14 NOTES 5 is the t-statistic. The acceptace regio is { } x R : φx) t. Thus A 1 x) = { } µ 0 R : φx) t = [ x ŝ t, x + ŝ ] t is a 1 -cofidece iterval for µ 0. We remark that Lemma. has a coverse: it is possible to obtai a -level test from a 1 -cofidece set. Lemma.5. Let Cx) is a 1 -cofidece iterval for θ. The test ϕx) = 1 1 Cx) θ 0 ), H 0 : θ = θ 0 versus H 1 : θ θ 0. That is, the test that rejects whe Cx) does ot cotai θ 0 is a -level test. Proof. Sice Cx) is a 1 cofidece iterval for θ 0, P 0 θ0 Cx) ) 1. Thus E 0 [ 1 ϕx) ] = 1 P0 θ0 Cx) ) 1 1 ). A coveiet formalism that highlights the coectio betwee hypothesis testig ad iterval is that of pivot or pivotal quatity. Defiitio.6. A fuctio φx, θ) is a pivot for a parametric model if the its distributio uder x F θ does ot deped o θ. That is, ) φx, θ) C does ot deped o θ. We remark that a pivot is techically ot a statistic because the fuctio depeds o the ukow parameter θ. The caoical example of a pivot is the z-statistic x µ σ. It is a pivot for the ormal locatio-scale model. Give a pivot, it is possible to 1. test the hypothesis H 0 : θ = θ 0. Uder H 0, we kow the distributio of the pivot φx, θ 0 ). Thus comparig the observatio φx, θ 0 ) to the kow distributio is the basis of a test.

6 STAT 01B. form a cofidece iterval for θ. As we shall see, ivertig the pivot i its secod argumet leads to a cofidece iterval for θ. Let C φx, θ) be a set of 1 mass uder the distributio of the pivot: φx, θ) C ). If we pi the secod argumet of the pivot at θ 0 lettig φ θ0 x) = φx, θ 0 )) ad ivert the pivot i its first argumet, we obtai φ 1 θ 0 C) := { x X : φx, θ 0 ) C }, which is the acceptace regio of a -level test of H 0 : θ = θ 0. Ideed, 0 x φ 1 θ 0 C) ) = 0 φx, θ0 ) C ) = 1, If we pi the first argumet of the pivot at x lettig φ x θ) = φx, θ 0 )) ad ivert the pivot i its secod argumet, we obtai which is a 1 -cofidece set for θ. φ 1 x C) := { θ Θ : φx, θ) C }, Example.7 Example.3 cotiued). We kow x µ is a pivot for the Gaussia locatio model: if x N µ, 1), φx, µ) := x µ is a pivot. If we pi the secod argumet of the pivot at µ 0, the pre-image of [ z, z ] uder the pivot is φ 1 [ µ 0 z, z ]) { [ = x R : x µ z, z ]}, which is the acceptace regio of the -level LRT of H 0 : µ = µ 0 versus H 1 : µ µ 0. The pre-image of, z ] is φ 1 µ 0, z ]) = { x R : x µ, z ]}, which is the acceptace regio of the -level UMPU test of H 0 : µ = µ 0 versus H 1 : µ > µ 0. Fially, if we pi the first argumet of the pivot at µ 0 ad ivert the pivot i its secod argumet, we obtai φ 1 [ x z, z ]) { [ = µ R : x µ z, z ]}, which is a 1 -cofidece iterval for µ.

LECTURE 14 NOTES 7 I practice, it is usually ot possible to derive a exact pivot. However, asymptotic pivots are easier to obtai. Formally, a asymptotic pivot for a parametric model is a fuctio whose asymptotic distributio does ot deped o the parameter. The caoical example of a asymptotic pivot is: V 1 ˆθ θ ) d ) N 0, Ip. Usurprisigly, ivertig asymptotic pivots leads to asymptotically -level tests ad 1 -cofidece itervals..1. Most accurate iterval estimators. Defiitio.8. A 1 cofidece set Cx) is most accurate at θ if θ Cx)) θ C x)) for ay θ θ ad ay other 1 cofidece set C x). It is uifomly most accurate UMA) o Θ if it is most accurate at ay θ Θ. Ituitively, a most accurate 1 cofidece set is least likely amog 1 cofidece sets) to iclude icorrect parameters. A straightforward calculatio shows that a most accurate 1 cofidece set has the smallest expected Lesbegue) volume amog 1 cofidece sets. Ideed, ) [ [ ]] E θ vol Cx) = 1 Cx) θ )dθ f θ x)dx X Θ ) = 1 Cx) θ )f θ x)dx dθ Θ X = θ Cx) ) dθ, which is miimized whe the itegrad is miimized o Θ. Θ Theorem.9. A 1 -UMA cofidece set o Θ is the iverse of - level UMP tests of H 0 : θ = θ versus H 1 : θ Θ \ {θ }. Proof. Let Aθ ) be the acceptace area of a UMP test of H 0 : θ = θ versus H 1 : θ Θ \ {θ }. Sice the test is UMP, x X \ Aθ ) ) x X \ A θ ) ) for ay θ Θ \ {θ } ad ay other the acceptace area of ay other -level test A θ ). Equivaletly, x Aθ ) ) x A θ ) ).

8 STAT 01B We ivert the acceptace area to obtai θ A 1 x) ) θ A 1 x) ). We observe that to obtai a most accurate cofidece set at a poit, it is ecessary to ivert a family of most powerful tests. Ideed, by Theorem.9, the most accurate cofidece set at θ is the iverse of -level most powerful tests of H 0 : θ = θ versus H 1 : θ = θ. I hypothesis testig, there is ofte o UMP tests for a prescribed pair of ull ad alterative hypotheses. Similarly, i iterval estimatio, there is usually o UMA cofidece set. To make the otio of most powerful test tractable, we restricted our attetio to ubiased tests ad studied UMPU tests. There is a similar otio of ubiasedess for cofidece sets ad ivertig ubiased tests lead to ubiased cofidece sets. Defiitio.10. A 1 cofidece set is ubiased if θ Cx) ) 1 for ay θ θ. It is a uiformly most accurate ubiased UMAU) cofidece set o Θ if it is uiformly most accurate o Θ amog ubiased cofidece sets. Lemma.11. A 1 -ubiased cofidece set o Θ is the iverse of -level ubiased tests of H 0 : θ = θ versus H 1 : θ Θ \ {θ }. Proof. The lemma is almost a tautology. Let Aθ ) be the acceptace regio of a ubiased test of H 0 : θ = θ versus H 1 : θ Θ \ {θ }: x A 1 θ ) ) 1. We ivert the acceptace regio to obtai the stated coclusio. We combie Lemma.11 ad Theorem.9 to obtai the usurprisig) result that the iverse of the acceptace regio of a UMPU test is a UMAU cofidece set. The proof is very similar to the proof of Theorem.9, ad we skip the details here. Theorem.1. A 1 -UMAU cofidece set o Θ is the iverse of -level UMPU tests of H 0 : θ = θ versus H 1 : θ Θ \ {θ }. Yuekai Su Berkeley, Califoria December, 015