4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

Size: px
Start display at page:

Download "4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49"

Transcription

1 4 HYPOTHESIS TESTING 49 4 Hypothesis testing In sections 2 and 3 we considered the problem of estimating a single parameter of interest, θ. In this section we consider the related problem of testing whether or not θ equals a particular value of interest, or lies in a particular range of values of interest. Estimation and hypothesis testing can be thought of as two related (dual) aspects of the inference problem, as we shall see later. 4.1 Types of hypothesis and types of error Suppose X 1,X 2,...,X n are an independent random sample from a probability density function f X (x θ). Instead of estimating θ, we now wish to use the sample to test hypotheses about θ. Definition 4.1.1: Simple and composite hypotheses We define a hypothesis to be an assertion or conjecture about θ. If the hypothesis completely specifies the distribution of X, it is called a simple hypothesis. Otherwise it is called a composite hypothesis.

2 4 HYPOTHESIS TESTING 50 Example Suppose we take an independent random sample X 1,X 2,...,X n from a random variable X N(µ,σ 2 ). Conisder the following hypotheses. Which are simple and which are composite? (i) H 1 : µ = 100,σ = 15; (ii) H 2 : µ > 100,σ = 15; (iii) H 3 : µ > 100,σ = µ/10; (iv) H 4 : µ = 100; (v) H 5 : σ = 15; (vi) H 6 : µ < 100. Comparing two hypotheses Usually in hypothesis testing we compare two hypotheses, the first, called the null hypothesis is H 0 : θ ω and the second, the alternative hypothesis is H 1 : θ ω where ω S, ω ω = S, ω ω = and S is the set of all possible values for the parameter θ of the distribution of the random variable X.

3 4 HYPOTHESIS TESTING 51 Example We are interested in whether a new method of sealing light bulbs increases the average lifetime of the bulbs. Here, if θ is the mean lifetime of the bulbs sealed by the new method, and we know the mean lifetime of standard bulbs is 140 hours, our hypothesis test will be a test of H 0 : θ = 140 versus H 1 : θ > 140. Now suppose we assume that the lifetime X of a new bulb follows an Exponential distribution, i.e. X Exp(1/θ). Which of H 0 and H 1 is simple and which is composite? What are the sets S, ω and ω which define this hypothesis test? Definition 4.1.2: Acceptance region and rejection region Let A be the sample space of X, i.e. the set of all possible values of a random sample of size n from X. A test procedure divides A into subsets A 0 and A 1 (with A 0 A1 = A, A 0 A1 = ) such that if and if X A 0, we accept H 0 X A 1, we reject H 0 and accept H 1. A 0 is called the acceptance region and A 1 the rejection region of the test.

4 4 HYPOTHESIS TESTING 52 Definition 4.1.3: Type I error and type II error When performing a test we may make the correct decision, or one of two possible errors: (i) Type I error: reject H 0 when it is true; (ii) Type II error: accept H 0 when it is false. The type I error is usually regarded as the more serious mistake. The probabilities of making type I and type II errors are usually denoted by α(θ) and β(θ) respectively. Example Now returning to the lightbulbs sealed by the new method in Example 4.1.2, suppose that once again we wish to test: H 0 : θ = 140 versus H 1 : θ > 140, and we collect some data consisting of ten measurements of lifetimes x 1,...,x 10. Suppose we choose to accept H 0 if the sample mean x satisfies x < 150, and to reject H 0 (and hence accept H 1 ) if x 150. What are the sample space, the acceptance region and the rejection region for this test? What are the Type I and Type II errors in this specific case?

5 4 HYPOTHESIS TESTING 53 In Sections 4.2 to 4.6 we will develop the ideas of hypothesis testing by studying the main important cases. 4.2 Inference for a single Normal sample For this section we will assume that X 1,X 2,...,X n is an i.i.d. random sample from a N(µ,σ 2 ) distribution. For the time being, we assume σ 2 is known, i.e. a constant. Moreover, a particular value µ = µ 0 for the population mean has been suggested by previous work or ideas. In this case the null hypothesis is denoted by H 0 : µ = µ 0. There are a variety of options for the alternative hypothesis. Commonly used alternative hypotheses are: (A) H 1 : µ = µ 1 > µ 0 (B) H 1 : µ = µ 1 < µ 0 (µ 1 fixed constant) (µ 1 fixed constant) (C) H 1 : µ > µ 0 (D) H 1 : µ < µ 0 (E) H 1 : µ µ 0. Example Suppose the marks for a particular test are believed to follow a N(µ, 100) distribution, and the null hypothesis is H 0 : µ = 50. In which category (A) - (E) are each of the following alternative hypotheses: 1. H 1 : µ < 50; 2. H 1 : µ = 57; 3. H 1 : µ 50?

6 4 HYPOTHESIS TESTING 54 Alternative (E) is the most commonly used, and the easiest to justify in most real life situations. All the others assume some knowledge which it is usually unrealistic to assume. The null and alternative hypotheses are treated in the following way: we adopt the null hypothesis unless there is evidence against it. The test statistic we choose to use for a single Normal sample is X, the sample mean. It makes sense to test a hypothesis about the population mean µ using the sample mean X, but more than this, we know the distribution of X under the null hypothesis, which is crucial. If H 0 is true, X 1,...,X n are i.i.d. N(µ 0,σ 2 ) random variables, and so X N ( µ 0,σ 2 /n ) Z = X µ 0 σ/ n N(0,1). We now need to decide for which values of the test statistic we will reject H 0. These values will comprise the rejection region A 1. We reject H 0 in cases (A) or (C) : if Z is sufficiently far into the right-hand tail; (B) or (D) : if Z is sufficiently far into the left-hand tail; (E) : if Z is sufficiently far into either tail. In case (E) the rejection region is split between the tails of the distribution giving a twotailed test. The other cases are one-tailed tests. If P(Type I error)=α, the test is said to have significance level α. Commonly used significance levels are 0.05 (5%), 0.01 (1%) and (0.1%). Once the significance level is chosen, the rejection region is precisely determined.

7 4 HYPOTHESIS TESTING 55 Example Forα = 0.05, calculatetherejection regions(interms ofz) foreach categoryofalternative hypothesis (A) - (E). Example The widths (mm) of 64 beetles chosen from a particular locality were measured and the sample mean was found to be x = Previous extensive measurements of beetles of the same species had shown the widths to be Normally distributed with mean 23mm and variance 16mm. Test at the 5% level whether or not the beetles from the chosen locality have a different mean width from the main population, assuming that they have the same variance.

8 4 HYPOTHESIS TESTING 56

9 4 HYPOTHESIS TESTING cont. A single Normal sample with unknown variance σ 2 Nowweconsider hypothesis testsaboutµwherex 1,X 2,...,X n isani.i.d. randomsample from a N(µ,σ 2 ) distribution, and σ 2 is unknown. This is usually more realistic than assuming we know σ 2, but it is also a more complex problem. We have to estimate µ in the presence of the nuisance parameter σ 2. The solution is to replace σ 2 with a suitable estimate; here we use the sample variance S 2. Example Cola makers test new recipes for loss of sweetness during storage. For one particular recipe, ten trained tasters rate the sweetness before and after, enabling us to calculate the change (sweetness after storage minus sweetness before storage), as follows: Before After Change Is there evidence that in general, the storage causes the cola to lose sweetness?

10 4 HYPOTHESIS TESTING 58 Solution/cont. When we knew σ 2, we used the test statistic which we know has a N(0,1) distribution. Z = X µ 0 σ/ n, Now we are estimating σ 2 using S 2, so our test statistic becomes T = X µ 0 s/ n, and this has a slightly different distribution, called the Student t distribution, or just the t distribution... Definition 4.2.1: The Student t distribution If Z N(0,1) and U χ 2 n are independent random variables then T n = Z U/n has a Student t-distribution on n-degrees of freedom. The distribution is denoted by t n.

11 4 HYPOTHESIS TESTING 59 Example Sketch the t distribution with (a) 1; (b) 5; (c) 100 degrees of freedom. Figure 2: the t 1, t 5 and t 100 distributions pdf pdf pdf t 1 0 t 5 0 t

12 4 HYPOTHESIS TESTING 60 The t-distribution with n degrees of freedom has a p.d.f. which is symmetric and bellshaped, like the Normal, but with somewhat thicker tails. Smaller values of n correspond to the thickest tails. Larger values of n cause the t n distribution to be more like the Normal distribution. All we have to be able to do is to use statistical tables or R to look up the appropriate tail probability, since the distribution of our test statistic is given by: Example T n 1 = X µ 0 s/ n t n 1. For the cola example in the test statistic was t = 2.697, and the sample size was n = 10. Carry out the test of against H 0 : µ = 0 (no loss in sweetness); H 1 : µ < 0 (some loss in sweetness).

13 4 HYPOTHESIS TESTING Hypothesis test for two Normal means: two sample t test Now suppose we have two samples (x 1,x 2,...,x n1 ) and (x n1 +1,x n1 +2,...,x n1 +n 2 ), i.e. samples of sizes n 1 and n 2 from two different populations. We are interested in whether the two population means are equal. Assuming that the data are sampled from Normally distributed populations with equal variance, σ 2, in each population, then if we want to test H 0 : µ 1 = µ 2 versus H 1 : µ 1 µ 2 where µ 1 and µ 2 are the means of each population, we can perform a t-test with test statistic given by... t = x 1 x 2, where s = 1 s n n 2 (n 1 1)s 2 1 +(n 2 1)s 2 2, n 1 +n 2 2 where x 1, x 2, s 1 and s 2 are the sample means and standard deviations from each population. Here s = s 2 is the pooled estimate of the common standard deviation σ. If the null hypothesis is true, then the test statistic comes from a t distribution on n 1 +n 2 2 degrees of freedom, so we use the tables for t n1 +n 2 2 to carry out the test. This test is called the two sample t test.

14 4 HYPOTHESIS TESTING 62 Example Consider the lifetime of two brands of light bulbs. For a random sample of n 1 = 12 bulbs of one brand the mean bulb life is x 1 = 3,400 hours with a sample standard deviation of s 1 = 240 hours. Forthesecond brandofbulbsthemeanbulblifeforasampleofn 2 = 8bulbsis x 2 = 2,800 hours with s 2 = 210 hours. We assume that distribution of bulb life is approximately Normal, and the standard deviations of the two populations are assumed to be equal. Test using a two sample t-test at the 1% level. H 0 : µ 1 = µ 2 versus H 1 : µ 1 µ 2

15 4 HYPOTHESIS TESTING Two Normal populations: testing the assumption of equal variances In Section 4.3 we had to make the assumption that our two Normal populations had equal variance σ 2. Here we see how we can carry out a hypothesis test to check this assumption! We denote the two population variances by σ 2 1 and σ 2 2. We wish to test H 0 : σ 2 1 = σ2 2 versus H 1 : σ 2 1 σ2 2. Notice that these hypotheses don t make any assumptions about the values of µ 1 and µ 2. If the null hypothesis is true, then the ratio of sample variances S 2 1 S 2 2 will have a distribution called the F-distribution, on n 1 1 and n 2 1 degrees of freedom. Definition 4.4.1: The F distribution If U and V are independent chi-square random variables such that U χ 2 r and V χ2 s, then F = U/r V/s has an F distribution on r and s degrees of freedom. The distribution is denoted by F r,s. Note that the F distribution is characterized by two separate measures of degrees of freedom: r corresponds to the numerator and s corresponds to the denominator. Printed F tables are available, and of course we can always use R (except in an exam!). Note that it follows immediately that the reciprocal ratio of sample variances S 2 1 will have an F distribution on n 2 1 and n 1 1 degrees of freedom. S 2 2 In practice, we carry out the hypothesis test for equal variances as follows. We will only consider the case of the two sided alternative ( not equal ), giving rise to a two tailed test. In this case it is sensible to reject H 0 if either s 2 1/s 2 2 or s 2 2/s 2 1 is large. We form our test statistic as { s 2 F = max 1, s2 2 s 2 2 s 2 1 and compare this with F r,s tables, where if s 1 1 > s2 2 we set r = n 1 1 and s = n 2 1, while if s 1 2 > s2 1 we set r = n 2 1 and s = n 1 1. To account for the fact that under H 0, these two outcomes could happen with equal probability, the significance level of the test is *double* the upper tail probability of the F distribution (obtained from tables or R). },

16 4 HYPOTHESIS TESTING 64 Example For the data in Example 4.3.1, test the assumption that the standard deviations of the two populations are equal.

17 4 HYPOTHESIS TESTING Inference for a single Binomial proportion (r not small!) Here we consider the situation where we have a single observation x from a Binomial random variable X Bin(r,θ), and we are interested in testing hypotheses about θ. Note that x can be viewed as the number of successes from r independent trials, each with success probability θ. In this section we consider the case where r is not small, i.e. r > 20. We will test H 0 : θ = θ 0 against an alternative from one of the categories (A) to (E) above. Example UK survey of sexual behaviour: in 2004/05, 11% of UK residents aged claimed to have had more than one sexual partner. Suppose that in , a random sample of 600 UK residents in the age group shows that 83 had more than one sexual partner. Is this evidence for an increase in the population proportion having more than one sexual partner? Formulate this problem as a hypothesis test.

18 4 HYPOTHESIS TESTING 66 We need to derive a test statistic whose distribution we can evaluate conditional on H 0 being true. We use the Normal approximation to the Binomial distribution. I.e. if X Bin(r,θ), with r > 20, then to a reasonable approximation X N[rθ,rθ(1 θ)]. (Note that the approximation involves rounding the outcome of a Normal random variable to the nearest integer! See below.) Now suppose the null hypothesis H 0 is true, i.e. θ = θ 0. Then the Normal approximation implies X N[rθ 0,rθ 0 (1 θ 0 )], and hence the test statistic Z = X rθ 0 rθ0 (1 θ 0 ) has a N(0,1) distribution. This means we can carry out a one sample z test exactly as we did in Section 4.2. N.B. because of the rounding issue, it makes sense to replace x in the test statistic by x 0.5 when x > rθ 0, and by x+0.5 when x < rθ 0. This is called a continuity correction. Example Forthesexual behaviour datainexample 4.5.1wehaver = 600, wehaveobserved x = 83, and we want to test H 0 : θ = 0.11 against H 1 : θ > Carry out the hypothesis test.

19 4 HYPOTHESIS TESTING 67 Notes on significance levels and p values 1. If you are not told what level of significance to use, a sensible procedure is to test at the 5% level. If not significant then stop, otherwise test at the 1% level. If not significant then stop, otherwise test at the 0.1% level. 2. If you have access to the p-value, e.g. from Normal tables, or from R (see Exercises 4B Questions 1 and 2) then you immediately have the result of a hypothesis test at any given significance level. E.g. in Example immediately above, we had p = It follows immediately that our test is significant at 5% but not at 1%, because 0.05 > p > 0.01.

20 4 HYPOTHESIS TESTING Inference for two Binomial proportions (samples not small!) Example Consider a survey of employment carried out seperately in Northern England and Scotland, among people who had left school six months earlier. Suppose we obtain the following data: Unemployed Employed Scotland Northern England Total In general we have two independent samples of size n 1 and n 2, with each observation classified as success or failure: Sample 1 Sample 2 Total Success O 11 O 12 R 1 = O 11 +O 12 Failure O 21 O 22 R 2 = O 21 +O 22 n 1 n 2 n = n 1 +n 2 Assuming all observations are independent, and that the success probability is constant within each sample, we have two Binomial samples. Suppose that the true probabilities of success are θ 1 and θ 2. We wish to test H 0 : θ 1 = θ 2 versus H 1 : θ 1 θ 2. As always with a hypothesis test, we need to find a test statistic whose distribution is known when H 0 is true. Now if H 0 is true, then θ 1 = θ 2 = θ, say. The combined samples give the number of successes in n 1 + n 2 trials, in each of which there is a probability θ of a success. So we may estimate θ by ˆθ = R 1 /n, where R 1 = O 11 +O 12 (total for first row) n = n 1 +n 2 (grand total).

21 4 HYPOTHESIS TESTING 69 Hence, under H 0, the expected number of successes in each of the samples is where E 11 = n 1R 1 n ; E 21 = n 1R 2 n ; E 12 = n 2R 1 n ; E 22 = n 2R 2 n. R 2 = O 21 +O 22 (total for second row). To measure how closely the expected values match the observed values we calculate the test statistic 2 2 X 2 (O ij E ij ) 2 =. E ij i=1 j=1 Under H 0, X 2 has an asymptotic distribution which is a χ 2 1 distribution (a chi square distribution with 1 degree of freedom ). Definition 4.6.1: The chi square distribution χ 2 n If Z 1,...,Z n are independent N(0,1) random variables, then X 2 = n i=1 Z 2 i has a chi square distribution on n-degrees of freedom. The distribution is denoted by χ 2 n. If H 0 is true, the observed values should be close to the expected values, and so X 2 will be small. Hence we reject H 0 if X 2 is large enough, using Tables (or R).

22 4 HYPOTHESIS TESTING 70 Example Consider the data in Example 4.6.1: Test H 0 : the unemployment rates are equal against H 1 : the unemployment rates are not equal.

23 4 HYPOTHESIS TESTING 71 Notes 1. The method we just described for 2 2 tables also works for r c tables, that is tables with r rows and c columns. The test statistic is given by X 2 = r i=1 c j=1 (O ij E ij ) 2 E ij, and this is compared with a chi-square distribution with (r 1) (c 1) degrees of freedom, i.e. χ 2 (r 1)(c 1). 2. Since deviation from what is expected under H 0 always corresponds to higher values of X 2, chi square tests for 2 proportions (and for r c contingency tables) are ***always*** 1 tailed, and always use the upper tail of the chi square distribution!!!

24 4 HYPOTHESIS TESTING The relationship between hypothesis tests and confidence intervals Every hypothesis test we carry out has a corresponding confidence interval associated with it! Example For the beetle widths given in Example 4.2.3, calculate a 95% confidence interval for the population mean µ. Looking back at that example, we can deduce immediately that 23 also lies outside the 99% confidence interval, and the 99.9% confidence interval. (Exercise: check this!) The general rule is: The 100(1 α)% confidence interval consists precisely of all those values which would not be rejected at the 100α% significance level.

25 4 HYPOTHESIS TESTING Hypothesis tests: size and power function Hypothesis tests can be described in terms of their size and power. Definition 4.8.1: the size of a hypothesis test Consider a particular hypothesis test on a single parameter θ. We define the size of the test to be sup{pr(reject H 0 )}. θ ω Note that for a simple null hypothesis, this is just the probability we reject H 0 if it s true, i.e. the probability of a Type I error. For a composite null hypothesis, it is the supremum of this rejection probability over all the values of θ for which the null hypothesis holds. Definition 4.8.2: the power function for a hypothesis test The power K(θ) is the probability of rejecting H 0, considered as a function of θ. A plot of the power function is helpful in determining how good our test is at rejecting the null hypothesis when it is false. Informally, the power of a test is often used to refer to the probability that it will reject the null hypothesis when it is false. However from our different categories of alternative hypothesis (A) - (E), this only makes real sense for (A) and (B), i.e. when we are comparing two simple hypotheses.

26 4 HYPOTHESIS TESTING 74 Example Suppose X 1,...,X 4 is a random sample from X N(µ,36), and we wish to test H 0 : µ = 10 against H 1 : µ > 10. Note that this is a one tailed alternative. Now suppose we base our rejection region on the value of X; specifically we construct it as A1 = {X : X > 17}. (a) Plot the power function for this test in the range 10 µ 20. (b) What is the size of this test? (c) What would be the power of the test if the alternative was, in fact, H 1 : µ = 22?

27 4 HYPOTHESIS TESTING 75 (cont.) Choice of rejection region In Example we found that our rejection region gave a test with desirable properties: a standard size of 1%, and a well-defined power function. So how can we design such a test ourselves? Fortunately there is a very useful theorem which helps us to define an optimal rejection region...

28 4 HYPOTHESIS TESTING 76 The Neyman-Pearson Lemma Suppose we have a random sample x 1,x 2,...,x n from a random variable X with density f X (x θ), and we wish to test H 0 : θ = θ 0 against the simple alternative H 1 : θ = θ 1. Consider the Likelihood Ratio defined as Λ(x) = L(θ 0 x) L(θ 1 x). Suppose we define a test by rejecting H 0 in favour of H 1 if Λ(x) is small enough. Specifically, suppose we choose a cut off point η such that Pr(Λ(x) η H 0 ) = α. Then the test based on the rejection region A 1 = {x : Λ(x) η} is the most powerful test of size α. Now suppose we have a composite alternative hypothesis H 1 : θ Θ 1. If the test is the most powerful for all θ 1 Θ 1, then it is said to be the uniformly most powerful (UMP) test for alternatives in the set Θ 1. Notes 1. Informally, the Neyman-Pearson Lemma says that if we base our test on the value of the likelihood ratio, then we get the best possible test (in the sense of being the most powerful). 2. Note that if we need to define a rejection region in terms of it is often easier to work with Λ(x) = L(θ 0 x) L(θ 1 x), log[λ(x)] = log[l(θ 0 x)] log[l(θ 1 x)]. Example SupposeX 1,X 2,...,X n isarandomsamplefroman(µ,σ 2 )distributionwhereµisknown, and we wish to test where σ 2 0 < σ 2 1. H 0 : σ 2 = σ 2 0 versus H 1 : σ 2 = σ 2 1, Find an appropriate test statistic on which to base a rejection region.

29 4 HYPOTHESIS TESTING 77

30 4 HYPOTHESIS TESTING Small sample methods In this section we consider statistical inference (estimation and hypothesis testing) in situations where the sample size is small. The crucial change from large sample methods is that we can no longer rely on the asymptotic distribution of either the maximum likelihood estimator, or the test statistic, in a hypothesis test. In fact the cases for one and two Normal means have already been dealt with, because the adjustments made to deal with unknown variance (t tests!) work for arbitrarily small samples. The cases which need special treatment are the cases of (a) inference on one Binomial proportion, and (b) the comparison of two Binomial proportions Inference for a single Binomial proportion (r is small!) Suppose we have a single observation x from a Binomial random variable X Bin(r,θ), and we want to test hypotheses about θ. This is the same kind of problem as we considered in Section 4.5, but this time we assume that the number of trials r is small, i.e. 20. The crucial difference is that the Normal approximation is now too poor to use, and we should use the Binomial distribution directly (using Tables or R). The fact that we are now working with a genuinely discrete distribution leads to a complication: we cannot carry out a test precisely for any specified significance level; we have to use the nearest approximate significance level. Example A leading cat food manufacturer has a slogan which could be interpreted as follows: 80% of cats prefer our product. In an experiment to test this, 20 cats are each given the choice between the product in question, Brand W, and the leading market competitor, Brand X. Result: 12 cats go for Brand W, and 8 cats go for Brand X. Is this evidence against Brand W s claim?

31 4 HYPOTHESIS TESTING 79

32 4 HYPOTHESIS TESTING Inference for two Binomial proportions (small samples!) Here we consider the same kind of problem as in Section 4.6, i.e. two Binomial proportions, with the data arranged in a 2 2 contingency table. Here we consider the case when one or both samples are small. Example A small study into the dieting habits of teenagers is undertaken, to investigate whether or not the proportions of males and females who diet are equal. Suppose the population proportions of males and females who are dieting at any one time are denoted by θ M and θ F respectively. We wish to test: H 0 : θ M = θ F against H 1 : θ M θ F. A random sample of 12 boys and 12 girls is selected, and we ascertain whether each individual is currently on a diet. Data: dieting not dieting Total Table 4.1 boys girls Total

33 4 HYPOTHESIS TESTING 81 It certainly appears that in the population, girls are more likely to be dieting, since in our sample: 9 out of 12 girls are dieting; 1 out of 12 boys are dieting. The question is: How significant are these results? In other words, how much evidence do we have against H 0 : θ M = θ F? The way we answer this is that we assume the row totals and the column totals are fixed at the observed values. We then assume that H 0 is true (as ever!) and we ask, how unlikely is the result we have observed? In other words: If we were to choose 10 of the teenagers at random, what is the probability that 9 of them would be among the 12 girls, and only 1 from among the 12 boys? The p value for this test will be the probability of all outcomes which are as extreme as this one, or more so... We introduce the notation: dieting not dieting Total boys girls Total

34 4 HYPOTHESIS TESTING 82 Table 4.2 dieting not dieting Total boys girls Total

35 4 HYPOTHESIS TESTING 83 Review of Section 4 In this section we have: 1. Introduced the principles of hypothesis testing. 2. Seen how to carry out hypothesis tests in some specific cases when the sample size n is reasonably large: (a) the mean of a single Normal population (variance known and unknown); (b) the means of two Normal populations (variances assumed equal); (c) the variances of two Normal populations; (d) the success probability for a single Binomial proportion; (e) the success probabilities for two Binomial proportions; 3. Introduced three new probability distributions needed to carry out the tests: the Student t distribution, the F distribution and the chi square distribution. 4. Learned how to use Statistical Tables to carry out the tests at specific significance levels. 5. Learned how to use R to do some of these tests, and to interpret the precise p value obtained. 6. Understood the relationship between hypothesis tests and confidence intervals. 7. Considered the properties of hypothesis tests, namely size and power. 8. Seen how we may construct the most powerful tests using the Neyman-Pearson Lemma. 9. Seen how to carry out hypothesis tests relating to the success probability of a single Binomial distribution when the number of trials is small ( 20); 10. Seen how to carry out hypothesis tests to compare two Binomial proportions when the sample sizes in a 2 2 contingency table are small. [Note that for the cases of one Normal mean and two Normal means, the methods we developed in Section 3 (z-tests and t-tests) already work for arbitrarily small samples.]

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function

More information

Summary of Chapters 7-9

Summary of Chapters 7-9 Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two

More information

Definition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution.

Definition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution. Hypothesis Testing Definition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution. Suppose the family of population distributions is indexed

More information

CONTINUOUS RANDOM VARIABLES

CONTINUOUS RANDOM VARIABLES the Further Mathematics network www.fmnetwork.org.uk V 07 REVISION SHEET STATISTICS (AQA) CONTINUOUS RANDOM VARIABLES The main ideas are: Properties of Continuous Random Variables Mean, Median and Mode

More information

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes Neyman-Pearson paradigm. Suppose that a researcher is interested in whether the new drug works. The process of determining whether the outcome of the experiment points to yes or no is called hypothesis

More information

Mathematical Statistics

Mathematical Statistics Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

Chapters 10. Hypothesis Testing

Chapters 10. Hypothesis Testing Chapters 10. Hypothesis Testing Some examples of hypothesis testing 1. Toss a coin 100 times and get 62 heads. Is this coin a fair coin? 2. Is the new treatment on blood pressure more effective than the

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1 4 Hypothesis testing 4. Simple hypotheses A computer tries to distinguish between two sources of signals. Both sources emit independent signals with normally distributed intensity, the signals of the first

More information

Hypothesis Testing. ECE 3530 Spring Antonio Paiva

Hypothesis Testing. ECE 3530 Spring Antonio Paiva Hypothesis Testing ECE 3530 Spring 2010 Antonio Paiva What is hypothesis testing? A statistical hypothesis is an assertion or conjecture concerning one or more populations. To prove that a hypothesis is

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

40.530: Statistics. Professor Chen Zehua. Singapore University of Design and Technology

40.530: Statistics. Professor Chen Zehua. Singapore University of Design and Technology Singapore University of Design and Technology Lecture 9: Hypothesis testing, uniformly most powerful tests. The Neyman-Pearson framework Let P be the family of distributions of concern. The Neyman-Pearson

More information

Topic 19 Extensions on the Likelihood Ratio

Topic 19 Extensions on the Likelihood Ratio Topic 19 Extensions on the Likelihood Ratio Two-Sided Tests 1 / 12 Outline Overview Normal Observations Power Analysis 2 / 12 Overview The likelihood ratio test is a popular choice for composite hypothesis

More information

5.2 Tests of Significance

5.2 Tests of Significance 5.2 Tests of Significance Example 5.7. Diet colas use artificial sweeteners to avoid sugar. Colas with artificial sweeteners gradually lose their sweetness over time. Manufacturers therefore test new colas

More information

Hypothesis Testing Chap 10p460

Hypothesis Testing Chap 10p460 Hypothesis Testing Chap 1p46 Elements of a statistical test p462 - Null hypothesis - Alternative hypothesis - Test Statistic - Rejection region Rejection Region p462 The rejection region (RR) specifies

More information

STAT 830 Hypothesis Testing

STAT 830 Hypothesis Testing STAT 830 Hypothesis Testing Richard Lockhart Simon Fraser University STAT 830 Fall 2018 Richard Lockhart (Simon Fraser University) STAT 830 Hypothesis Testing STAT 830 Fall 2018 1 / 30 Purposes of These

More information

Let us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided

Let us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided Let us first identify some classes of hypotheses. simple versus simple H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided H 0 : θ θ 0 versus H 1 : θ > θ 0. (2) two-sided; null on extremes H 0 : θ θ 1 or

More information

Topic 10: Hypothesis Testing

Topic 10: Hypothesis Testing Topic 10: Hypothesis Testing Course 003, 2016 Page 0 The Problem of Hypothesis Testing A statistical hypothesis is an assertion or conjecture about the probability distribution of one or more random variables.

More information

Chapters 10. Hypothesis Testing

Chapters 10. Hypothesis Testing Chapters 10. Hypothesis Testing Some examples of hypothesis testing 1. Toss a coin 100 times and get 62 heads. Is this coin a fair coin? 2. Is the new treatment more effective than the old one? 3. Quality

More information

Statistics 3858 : Contingency Tables

Statistics 3858 : Contingency Tables Statistics 3858 : Contingency Tables 1 Introduction Before proceeding with this topic the student should review generalized likelihood ratios ΛX) for multinomial distributions, its relation to Pearson

More information

Chapter 5: HYPOTHESIS TESTING

Chapter 5: HYPOTHESIS TESTING MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate

More information

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,

More information

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1, Economics 520 Lecture Note 9: Hypothesis Testing via the Neyman-Pearson Lemma CB 8., 8.3.-8.3.3 Uniformly Most Powerful Tests and the Neyman-Pearson Lemma Let s return to the hypothesis testing problem

More information

Topic 10: Hypothesis Testing

Topic 10: Hypothesis Testing Topic 10: Hypothesis Testing Course 003, 2017 Page 0 The Problem of Hypothesis Testing A statistical hypothesis is an assertion or conjecture about the probability distribution of one or more random variables.

More information

STA 732: Inference. Notes 2. Neyman-Pearsonian Classical Hypothesis Testing B&D 4

STA 732: Inference. Notes 2. Neyman-Pearsonian Classical Hypothesis Testing B&D 4 STA 73: Inference Notes. Neyman-Pearsonian Classical Hypothesis Testing B&D 4 1 Testing as a rule Fisher s quantification of extremeness of observed evidence clearly lacked rigorous mathematical interpretation.

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

F79SM STATISTICAL METHODS

F79SM STATISTICAL METHODS F79SM STATISTICAL METHODS SUMMARY NOTES 9 Hypothesis testing 9.1 Introduction As before we have a random sample x of size n of a population r.v. X with pdf/pf f(x;θ). The distribution we assign to X is

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

STAT 830 Hypothesis Testing

STAT 830 Hypothesis Testing STAT 830 Hypothesis Testing Hypothesis testing is a statistical problem where you must choose, on the basis of data X, between two alternatives. We formalize this as the problem of choosing between two

More information

TUTORIAL 8 SOLUTIONS #

TUTORIAL 8 SOLUTIONS # TUTORIAL 8 SOLUTIONS #9.11.21 Suppose that a single observation X is taken from a uniform density on [0,θ], and consider testing H 0 : θ = 1 versus H 1 : θ =2. (a) Find a test that has significance level

More information

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline)

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) 1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) Gujarati D. Basic Econometrics, Appendix A.8 Barrow M. Statistics

More information

ST495: Survival Analysis: Hypothesis testing and confidence intervals

ST495: Survival Analysis: Hypothesis testing and confidence intervals ST495: Survival Analysis: Hypothesis testing and confidence intervals Eric B. Laber Department of Statistics, North Carolina State University April 3, 2014 I remember that one fateful day when Coach took

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 4.0 Introduction to Statistical Methods in Economics Spring 009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will

More information

Hypothesis testing: theory and methods

Hypothesis testing: theory and methods Statistical Methods Warsaw School of Economics November 3, 2017 Statistical hypothesis is the name of any conjecture about unknown parameters of a population distribution. The hypothesis should be verifiable

More information

Ch. 5 Hypothesis Testing

Ch. 5 Hypothesis Testing Ch. 5 Hypothesis Testing The current framework of hypothesis testing is largely due to the work of Neyman and Pearson in the late 1920s, early 30s, complementing Fisher s work on estimation. As in estimation,

More information

Political Science 236 Hypothesis Testing: Review and Bootstrapping

Political Science 236 Hypothesis Testing: Review and Bootstrapping Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The

More information

STAT 801: Mathematical Statistics. Hypothesis Testing

STAT 801: Mathematical Statistics. Hypothesis Testing STAT 801: Mathematical Statistics Hypothesis Testing Hypothesis testing: a statistical problem where you must choose, on the basis o data X, between two alternatives. We ormalize this as the problem o

More information

CH.9 Tests of Hypotheses for a Single Sample

CH.9 Tests of Hypotheses for a Single Sample CH.9 Tests of Hypotheses for a Single Sample Hypotheses testing Tests on the mean of a normal distributionvariance known Tests on the mean of a normal distributionvariance unknown Tests on the variance

More information

With our knowledge of interval estimation, we can consider hypothesis tests

With our knowledge of interval estimation, we can consider hypothesis tests Chapter 10 Hypothesis Testing 10.1 Testing Hypotheses With our knowledge of interval estimation, we can consider hypothesis tests An Example of an Hypothesis Test: Statisticians at Employment and Immigration

More information

The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80

The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80 The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80 71. Decide in each case whether the hypothesis is simple

More information

MTMS Mathematical Statistics

MTMS Mathematical Statistics MTMS.01.099 Mathematical Statistics Lecture 12. Hypothesis testing. Power function. Approximation of Normal distribution and application to Binomial distribution Tõnu Kollo Fall 2016 Hypothesis Testing

More information

STAT Section 3.4: The Sign Test. The sign test, as we will typically use it, is a method for analyzing paired data.

STAT Section 3.4: The Sign Test. The sign test, as we will typically use it, is a method for analyzing paired data. STAT 518 --- Section 3.4: The Sign Test The sign test, as we will typically use it, is a method for analyzing paired data. Examples of Paired Data: Similar subjects are paired off and one of two treatments

More information

Statistical Inference. Hypothesis Testing

Statistical Inference. Hypothesis Testing Statistical Inference Hypothesis Testing Previously, we introduced the point and interval estimation of an unknown parameter(s), say µ and σ 2. However, in practice, the problem confronting the scientist

More information

Unit 22 deals with statistical inference, which uses the concepts of probability to explain

Unit 22 deals with statistical inference, which uses the concepts of probability to explain UNIT 22 STATISTICAL INFERENCE: TESTS OF HYPOTHESIS Structure 22.1 Introduction 22.2 Objectives 22.3 Statistical Inference 22.4 Cases 22.5 Tests of Significance 22.6 Conclusion 22.7 Further Reading 22.1

More information

Some General Types of Tests

Some General Types of Tests Some General Types of Tests We may not be able to find a UMP or UMPU test in a given situation. In that case, we may use test of some general class of tests that often have good asymptotic properties.

More information

The Components of a Statistical Hypothesis Testing Problem

The Components of a Statistical Hypothesis Testing Problem Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one

More information

Statistics 135 Fall 2008 Final Exam

Statistics 135 Fall 2008 Final Exam Name: SID: Statistics 135 Fall 2008 Final Exam Show your work. The number of points each question is worth is shown at the beginning of the question. There are 10 problems. 1. [2] The normal equations

More information

Review. December 4 th, Review

Review. December 4 th, Review December 4 th, 2017 Att. Final exam: Course evaluation Friday, 12/14/2018, 10:30am 12:30pm Gore Hall 115 Overview Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 6: Statistics and Sampling Distributions Chapter

More information

simple if it completely specifies the density of x

simple if it completely specifies the density of x 3. Hypothesis Testing Pure significance tests Data x = (x 1,..., x n ) from f(x, θ) Hypothesis H 0 : restricts f(x, θ) Are the data consistent with H 0? H 0 is called the null hypothesis simple if it completely

More information

41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes

41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes Tests Concerning a Single Sample 41.2 Introduction This Section introduces you to the basic ideas of hypothesis testing in a non-mathematical way by using a problem solving approach to highlight the concepts

More information

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Partitioning the Parameter Space. Topic 18 Composite Hypotheses Topic 18 Composite Hypotheses Partitioning the Parameter Space 1 / 10 Outline Partitioning the Parameter Space 2 / 10 Partitioning the Parameter Space Simple hypotheses limit us to a decision between one

More information

McGill University. Faculty of Science. Department of Mathematics and Statistics. Part A Examination. Statistics: Theory Paper

McGill University. Faculty of Science. Department of Mathematics and Statistics. Part A Examination. Statistics: Theory Paper McGill University Faculty of Science Department of Mathematics and Statistics Part A Examination Statistics: Theory Paper Date: 10th May 2015 Instructions Time: 1pm-5pm Answer only two questions from Section

More information

Chapter 7. Hypothesis Testing

Chapter 7. Hypothesis Testing Chapter 7. Hypothesis Testing Joonpyo Kim June 24, 2017 Joonpyo Kim Ch7 June 24, 2017 1 / 63 Basic Concepts of Testing Suppose that our interest centers on a random variable X which has density function

More information

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015 AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking

More information

Chapter 9: Hypothesis Testing Sections

Chapter 9: Hypothesis Testing Sections Chapter 9: Hypothesis Testing Sections 9.1 Problems of Testing Hypotheses 9.2 Testing Simple Hypotheses 9.3 Uniformly Most Powerful Tests Skip: 9.4 Two-Sided Alternatives 9.6 Comparing the Means of Two

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Topic 21 Goodness of Fit

Topic 21 Goodness of Fit Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known

More information

8: Hypothesis Testing

8: Hypothesis Testing Some definitions 8: Hypothesis Testing. Simple, compound, null and alternative hypotheses In test theory one distinguishes between simple hypotheses and compound hypotheses. A simple hypothesis Examples:

More information

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc. Hypothesis Tests and Estimation for Population Variances 11-1 Learning Outcomes Outcome 1. Formulate and carry out hypothesis tests for a single population variance. Outcome 2. Develop and interpret confidence

More information

Solution: First note that the power function of the test is given as follows,

Solution: First note that the power function of the test is given as follows, Problem 4.5.8: Assume the life of a tire given by X is distributed N(θ, 5000 ) Past experience indicates that θ = 30000. The manufacturere claims the tires made by a new process have mean θ > 30000. Is

More information

http://www.math.uah.edu/stat/hypothesis/.xhtml 1 of 5 7/29/2009 3:14 PM Virtual Laboratories > 9. Hy pothesis Testing > 1 2 3 4 5 6 7 1. The Basic Statistical Model As usual, our starting point is a random

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

Estimating the accuracy of a hypothesis Setting. Assume a binary classification setting

Estimating the accuracy of a hypothesis Setting. Assume a binary classification setting Estimating the accuracy of a hypothesis Setting Assume a binary classification setting Assume input/output pairs (x, y) are sampled from an unknown probability distribution D = p(x, y) Train a binary classifier

More information

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation

More information

BEST TESTS. Abstract. We will discuss the Neymann-Pearson theorem and certain best test where the power function is optimized.

BEST TESTS. Abstract. We will discuss the Neymann-Pearson theorem and certain best test where the power function is optimized. BEST TESTS Abstract. We will discuss the Neymann-Pearson theorem and certain best test where the power function is optimized. 1. Most powerful test Let {f θ } θ Θ be a family of pdfs. We will consider

More information

Topic 3: Hypothesis Testing

Topic 3: Hypothesis Testing CS 8850: Advanced Machine Learning Fall 07 Topic 3: Hypothesis Testing Instructor: Daniel L. Pimentel-Alarcón c Copyright 07 3. Introduction One of the simplest inference problems is that of deciding between

More information

Lecture 10: Generalized likelihood ratio test

Lecture 10: Generalized likelihood ratio test Stat 200: Introduction to Statistical Inference Autumn 2018/19 Lecture 10: Generalized likelihood ratio test Lecturer: Art B. Owen October 25 Disclaimer: These notes have not been subjected to the usual

More information

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm Math 408 - Mathematical Statistics Lecture 29-30. Testing Hypotheses: The Neyman-Pearson Paradigm April 12-15, 2013 Konstantin Zuev (USC) Math 408, Lecture 29-30 April 12-15, 2013 1 / 12 Agenda Example:

More information

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains: CHAPTER 8 Test of Hypotheses Based on a Single Sample Hypothesis testing is the method that decide which of two contradictory claims about the parameter is correct. Here the parameters of interest are

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

Hypothesis Testing. A rule for making the required choice can be described in two ways: called the rejection or critical region of the test.

Hypothesis Testing. A rule for making the required choice can be described in two ways: called the rejection or critical region of the test. Hypothesis Testing Hypothesis testing is a statistical problem where you must choose, on the basis of data X, between two alternatives. We formalize this as the problem of choosing between two hypotheses:

More information

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6. Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized

More information

11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should

11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should 11 CHI-SQUARED Chapter 11 Chi-squared Objectives After studying this chapter you should be able to use the χ 2 distribution to test if a set of observations fits an appropriate model; know how to calculate

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor

More information

Visual interpretation with normal approximation

Visual interpretation with normal approximation Visual interpretation with normal approximation H 0 is true: H 1 is true: p =0.06 25 33 Reject H 0 α =0.05 (Type I error rate) Fail to reject H 0 β =0.6468 (Type II error rate) 30 Accept H 1 Visual interpretation

More information

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu October

More information

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module

More information

Inferential statistics

Inferential statistics Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,

More information

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

We know from STAT.1030 that the relevant test statistic for equality of proportions is: 2. Chi 2 -tests for equality of proportions Introduction: Two Samples Consider comparing the sample proportions p 1 and p 2 in independent random samples of size n 1 and n 2 out of two populations which

More information

Methods for Statistical Prediction Financial Time Series I. Topic 1: Review on Hypothesis Testing

Methods for Statistical Prediction Financial Time Series I. Topic 1: Review on Hypothesis Testing Methods for Statistical Prediction Financial Time Series I Topic 1: Review on Hypothesis Testing Hung Chen Department of Mathematics National Taiwan University 9/26/2002 OUTLINE 1. Fundamental Concepts

More information

Chapter 10. Hypothesis Testing (I)

Chapter 10. Hypothesis Testing (I) Chapter 10. Hypothesis Testing (I) Hypothesis Testing, together with statistical estimation, are the two most frequently used statistical inference methods. It addresses a different type of practical problems

More information

How do we compare the relative performance among competing models?

How do we compare the relative performance among competing models? How do we compare the relative performance among competing models? 1 Comparing Data Mining Methods Frequent problem: we want to know which of the two learning techniques is better How to reliably say Model

More information

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS COURSE: CBS 221 DISCLAIMER The contents of this document are intended for practice and leaning purposes at the undergraduate

More information

Chapter 12: Inference about One Population

Chapter 12: Inference about One Population Chapter 1: Inference about One Population 1.1 Introduction In this chapter, we presented the statistical inference methods used when the problem objective is to describe a single population. Sections 1.

More information

VTU Edusat Programme 16

VTU Edusat Programme 16 VTU Edusat Programme 16 Subject : Engineering Mathematics Sub Code: 10MAT41 UNIT 8: Sampling Theory Dr. K.S.Basavarajappa Professor & Head Department of Mathematics Bapuji Institute of Engineering and

More information

About the HELM Project HELM (Helping Engineers Learn Mathematics) materials were the outcome of a three-year curriculum development project

About the HELM Project HELM (Helping Engineers Learn Mathematics) materials were the outcome of a three-year curriculum development project About the HELM Project HELM (Helping Engineers Learn Mathematics) materials were the outcome of a three-year curriculum development project undertaken by a consortium of five English universities led by

More information

Introduction to Statistics

Introduction to Statistics MTH4106 Introduction to Statistics Notes 15 Spring 2013 Testing hypotheses about the mean Earlier, we saw how to test hypotheses about a proportion, using properties of the Binomial distribution It is

More information

Chapter 9 Inferences from Two Samples

Chapter 9 Inferences from Two Samples Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

HYPOTHESIS TESTING: FREQUENTIST APPROACH. HYPOTHESIS TESTING: FREQUENTIST APPROACH. These notes summarize the lectures on (the frequentist approach to) hypothesis testing. You should be familiar with the standard hypothesis testing from previous

More information

Statistical Data Analysis Stat 3: p-values, parameter estimation

Statistical Data Analysis Stat 3: p-values, parameter estimation Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,

More information

Chapter 7 Comparison of two independent samples

Chapter 7 Comparison of two independent samples Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

LECTURE 5 HYPOTHESIS TESTING

LECTURE 5 HYPOTHESIS TESTING October 25, 2016 LECTURE 5 HYPOTHESIS TESTING Basic concepts In this lecture we continue to discuss the normal classical linear regression defined by Assumptions A1-A5. Let θ Θ R d be a parameter of interest.

More information

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Question 1. Suppose you want to estimate the percentage of

More information