Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Similar documents
Common Large/Small Sample Tests 1/55

Stat 200 -Testing Summary Page 1

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

independence of the random sample measurements, we have U = Z i ~ χ 2 (n) with σ / n 1. Now let W = σ 2. We then have σ 2 (x i µ + µ x ) 2 i =1 ( )

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Chapter 13, Part A Analysis of Variance and Experimental Design

This is an introductory course in Analysis of Variance and Design of Experiments.

Statistics 20: Final Exam Solutions Summer Session 2007

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

STAT431 Review. X = n. n )

Topic 9: Sampling Distributions of Estimators

Properties and Hypothesis Testing

1 Inferential Methods for Correlation and Regression Analysis

z is the upper tail critical value from the normal distribution

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Last Lecture. Wald Test

Topic 9: Sampling Distributions of Estimators

Chapter 5: Hypothesis testing

Topic 9: Sampling Distributions of Estimators

5. Likelihood Ratio Tests

Stat 319 Theory of Statistics (2) Exercises

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

SDS 321: Introduction to Probability and Statistics

Sample Size Determination (Two or More Samples)

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Sampling Distributions, Z-Tests, Power

Chapter 4 Tests of Hypothesis

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

1036: Probability & Statistics

Parameter, Statistic and Random Samples

University of California, Los Angeles Department of Statistics. Hypothesis testing

Successful HE applicants. Information sheet A Number of applicants. Gender Applicants Accepts Applicants Accepts. Age. Domicile

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Frequentist Inference

Chapter 6 Sampling Distributions

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Data Analysis and Statistical Methods Statistics 651

Lecture Notes 15 Hypothesis Testing (Chapter 10)

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

MATH/STAT 352: Lecture 15

Problem Set 4 Due Oct, 12

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

MA238 Assignment 4 Solutions (part a)

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

6 Sample Size Calculations

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Expectation and Variance of a random variable

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Math 140 Introductory Statistics

STATISTICAL INFERENCE

Final Examination Solutions 17/6/2010

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Lesson 2. Projects and Hand-ins. Hypothesis testing Chaptre 3. { } x=172.0 = 3.67

Hypothesis Testing (2) Barrow, Statistics for Economics, Accounting and Business Studies, 4 th edition Pearson Education Limited 2006

Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes.

Chapter 6 Principles of Data Reduction

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740

Topic 18: Composite Hypotheses

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Mathematical Notation Math Introduction to Applied Statistics

Power and Type II Error

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

Confidence Level We want to estimate the true mean of a random variable X economically and with confidence.

f(x i ; ) L(x; p) = i=1 To estimate the value of that maximizes L or equivalently ln L we will set =0, for i =1, 2,...,m p x i (1 p) 1 x i i=1

LESSON 20: HYPOTHESIS TESTING

Lecture 19: Convergence

Composite Hypotheses

Lecture 2: Monte Carlo Simulation

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

(7 One- and Two-Sample Estimation Problem )

STAC51: Categorical data Analysis

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Notes on Hypothesis Testing, Type I and Type II Errors

Statistical Inference About Means and Proportions With Two Populations

Chapter 22: What is a Test of Significance?

Module 1 Fundamentals in statistics

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

Lecture 9: Independent Groups & Repeated Measures t-test

A statistical method to determine sample size to estimate characteristic value of soil parameters

Statistical inference: example 1. Inferential Statistics

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Lecture 33: Bootstrap

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

Statisticians use the word population to refer the total number of (potential) observations under consideration

Chapter two: Hypothesis testing

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

POWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES*

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

Transcription:

Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed i Sectio 1.6. Example: Sample of size from a ormal populatio N (, ). H H : agaist :. The LR test leads to the critical regio X c, for some costat c. / X U der : (, / ),. / H X N P c P Z c X If P Type I Error, the Rejectio regio: z. / / i.e., R eject H : =, if X z, or X z, / / i. e., ( X z, X z ), the1(1- )% C I for. / / The alterative hypothesis H :, or equivaletly, or Two-sided alterative: True mea could be less tha (or more tha) the mea uder the ull. Therefore, the Critical Regio is two-tailed. Reject the ull if the sample mea is away, from the populatio mea uder the ull hypothesis, i either directio!

How far away is too far away: determied by the desired P T ype I Error What if the alterative hypothesis is oe-sided? H [Also called, size of the test procedure.] :, or H : The, as discussed i Sec. 1.5, the MP critical regio is oe tailed: Reject H : i favor of H : ; if X z, or Reject H : i favor of H : ; if X z, Thus the test procedure rejects the Null i favor of the alterative, if the Sample mea follows oe s expectatio uder the Alterative. Four Steps i Traditioal Testig of Hypotheses set up: (i) Formulate H ad H 1, ad specify the value of. (ii) Defie the appropriate Test Statistic ad its samplig distributio Determie the appropriate Critical Regio of Size. (iii) Collect the sample data ad calculate the value of the test statistic (iv) Check if this value falls i the critical regio, ad accordigly, Reject the ull, do ot reject the ull (accept), or reserve judgmet. I the above two sided or oe sides tests for the Normal meas, the test statistic X follows Normal distributio with mea ad variace ( / ). Uder the Null hypothesis, its mea is. After stadardizig X to Z, the critical values (cut-off poits for the critical regio) require the values of Z /, or Z respectively, from the Normal cdf Tables.

I the past, the Tables for area uder various samplig distributios could be calculated umerically for just a few values. So values of, like.1(oe i 1),.5 (oe i twety), ad.1 (1 i 1) became the de-facto stadard. But these values are ot ecessarily omipotet. I real applicatios, oe may wat to choose ay value of or i practice, depedig o the risk to be covered or relative cosequeces of the two types of errors. Fortuately, usig umerical aalysis, algorithms, ad high speed computig devices, oe ca ow calculate (or simulate) the area uder ay samplig distributio i ay specified regio. The curret practice ivolves computig the area uder the curve beyod the observed values of the test statistic. I the case of test for Normal mea, we calculate the area i the ( x ) tail of the stadard ormal curve. Let z deote the observed value of the stadardized test statistic Z. The area uder the appropriate tail beyod the observed value (shaded regio) is called the P-value, prob-value, tail probability, or the observed level of sigificace. This is simply ( i) P ( X x ) for H : > ; ( ii) P ( X x ) for H : < ; ( iii) P ( X x ) for H :. Check if the P-value is less tha or equal to the stated level of sigificace ad accordigly reject the ull hypothesis, accept it or reserve the judgmet.

I exploratory data aalysis, the p-value is also called the stregth of evidece agaist the Null Hypothesis. Sectio 13. Tests cocerig Populatio Meas If the populatio ca be assumed to be Normal, as discussed i Sectio 13.1, the test for the mea is based o the Z-statistic. If the populatio is ot ormal, but has a fiite variace, ad the sample size is large, so that the distributio of X ca be approximated by Normal distributio (usig CLT), we ca use the test based o Z-statistics. For < 3 ad ukow, usig the result of Exercise 1., the Likelihood Ratio Test for samples from ormal populatio, the x oe-sample t-test based o t, with -1degrees of s / freedom is used. It is oe-tailed or two-tailed test depedig o whether the alterative hypothesis is oe-sided or two-sided. For, >3, the t-test uses critical values from stadard ormal distributio. Sectio 13.3 Tests cocerig differece of two meas Examples: New medicie is as good as old, Two brad of tires have same mea tread life, average life times of two brads of bulbs differ by 1 hours, etc. Idepedet radom samples of size ad from two ormal populatios with meas, ad, ad kow variaces ad,

ad wat to test the ull hypothesis H : agaist the alteratives H : ; or ; or. 1 Usig the Likelihood Ratio test, ad the result from Ex. 8., implies that the respective critical regios ca be described as z z ; or z z ; or z z, w here / z x x If the idepedet samples are ot from ormal populatios, but both sample sizes are large eough to use CLT, oe ca use the above test. I this case if the variaces are ot kow, oe ca substitute the sample variaces s, s for, respectively. Small sample sizes ad ukow variaces: For idepedet samples from ormal populatios, eed to assume that the two populatios have same ukow variace. The Likelihood Ratio test yields a test based the pooled estimate of variace, give by ( 1) s ( 1) s 1 p s. ( ) The two-sample t-test is based o the t-statistic t x x. 1 1 s p This expressio is the value of a radom variable havig the t- distributio with degrees of freedom. Hece the critical regios for H : ; or ; or, are give 1 by t t t t /, ; ; ; or t ; or t respectively. Whe, the pooled variace simplifies to The t-statistic simplifies to s s s p t ( x x ) / ( s s ) /.

If the assumptio of equal variace is ot reasoable, there are other solutios available i the literature. But we will ot cosider them here. Not-Idepedet samples. (For example correlated variables from same populatio. o Paired data (before-after, sibligs, twis, etc.) o Use the mea d of the pair-wise differece, d x x ad i 1, i, i test the ull hypothesis for the mea of these differeces: H : agaist H : ; or ; or. d 1 d d d o For small, but samples from ormal populatio, we ca use the oe sample t-test based o t d s / with v 1, with the critical regios give by t t ; or t t ; or t t respectively. /, ; ; d