PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

Size: px
Start display at page:

Download "PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1"

Transcription

1 PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1

2 In previous lectures we have encountered problems of estimating an unknown population mean and constructing confidence intervals for the mean. We have given the example of cholesterol levels for people who go on a new diet that may help lower the cholesterol level. If we have a sample of these people and observe their cholesterol levels after they stay on the diet for some time, we can estimate the expected cholesterol level for people on this diet. Assume X 1, X 2,...,X n N(µ, σ 2 ), we can estimate µ with X and construct a (1 α)100% confidence interval X ± tα/2,df=n 1 S/ n PHP 2510 Lec 10: Hypothesis testing 2

3 The numerical example we gave was a sample of size 10: X = 180.8, SX 2 = 93. A 95% confidence interval for µ is X ± t α/2,df=n 1 S/ n = ± /10 = (173.9, 187.7). If we know that the mean cholesterol level from the general population is 200 (for example, you looked it up from cdc website), another question we may ask is: Is the sample mean we observe (180.8) compatible with the hypothesis that people on the diet actually have the same mean cholesterol level as the general population? What do we mean by compatible? In other words, if we had sampled 10 people randomly from the general population, instead of the diet sub population, could we have observed as well? PHP 2510 Lec 10: Hypothesis testing 3

4 To better illustrate the logic behind hypothesis testing, we will study a side example on whether the data are compatible with the hypothesis. Suppose we have a coin and the hypothesis is that this is a regular coin that head or tail has equal probability when I flip it. We refer to this hypothesis the null hypothesis and denote it by H 0. If now I flip it for 10 times and see all heads, you may start doubting the H 0 because it seems unlikely to observe all heads for 10 times if H 0 were true. How unlikely is it? The probability of observing 10 heads out of 10 flips based on H 0 (a fair coin) is ( ) (1.5) 0 =.5 10 = This is a very small probability not impossible, but rather unlikely. The data do not appear to be compatible with the hypothesis. PHP 2510 Lec 10: Hypothesis testing 4

5 There are two possible explanations: either something really unlikely happened (unlikely impossible), or that the hypothesis is wrong. Action: Reject the H 0. PHP 2510 Lec 10: Hypothesis testing 5

6 Does this suggest that we should simply compute the probability of observing the data under the null hypothesis (H 0 )? Suppose we flip a coin for 200 times and observe 100 heads and 100 tails. It seems there is no reason to doubt that this coin is fair, P(H) = P(T) =.5. However, under the model X Binomial(200,.5), P(X = 100) = This is not a large probability itself. ( ) (1.5) 100 = The probability of getting exactly 500 heads from 1000 flips is only.025. What should we do? We certainly are not willing to conclude that the observation is incompatible with the hypothesis in this case. PHP 2510 Lec 10: Hypothesis testing 6

7 We need an alternative hypothesis, which we denote with H 1 or H a. For example, if the coin is not fair, the alternative could be that p.5, but we do not know whether p >.5 or p <.5. Under the model H 0, which values are as or more extreme than the one observed? By more extreme, we mean data that would make you lean towards the alternative more compared to the data you observed. # of heads Probability PHP 2510 Lec 10: Hypothesis testing 7

8 Since our alternative is p.5, extreme observations are either small number of heads or large number of heads values that are far away from the EX = 10.5 = 5. As extreme as the observation 2 heads is 8 heads, more extreme than 2 heads is 0,1, or 9, or 10 heads. Now we can ask the question: what is probability of observing something as or more extreme than the actual data, under H 0 )? P 0 (X = 0 or 1 or 2 or 8 or 9, or 10) = P 0 (X = 0) + P 0 (X = 1) + P 0 (X = 2) + P 0 (X = 8) + P 0 (X = 9) + P 0 (X = 10) =.11 (We use subscript 0 in P 0 to indicate the probability is calculated under H 0 model.) PHP 2510 Lec 10: Hypothesis testing 8

9 This probability says that, if there are a lot of people doing the same experiment of flipping a fair coin for 10 times, about 11% of them will see values as or more extreme than 2 heads. If you don t consider 11% a very rare probability, then you may not be surprised when the observation is 2. PHP 2510 Lec 10: Hypothesis testing 9

10 We have actually done hypothesis testing already! Given the data observed 2 heads out of 10 flips of a coin, We tested the hypothesis H 0 that the coin is fair, against the alternative hypothesis H 1 that the coin is not fair. We computed the probability of observing results as or more extreme than the data, under H 0. This probability is referred to as the p-value. If the p-value is small, it means either something improbable has happened, or that H 0 is problematic. We reject H 0 when the p-value is small. How small is small? Traditionally people have used.05 and.01. This number is called the significance level. PHP 2510 Lec 10: Hypothesis testing 10

11 What if the alternative hypothesis is p <.5 instead of p.5? In hypothesis testing, we think that either H 0 or H 1 is true. H 1 is used to determine which values are as or more extreme under H 0. # of heads Probability Which values, compared to the actual data X = 2, would make you lean more towards the alternative hypothesis p <.5? These would be X = 0 and X = 1. 8,9,10 are no longer extreme values if the alternative is p <.5 instead of p.5. Now the p-value becomes P 0 (X=0 or 1 or 2) = P(X = 0) + P(X = 1) + P(X = 2) = When we used H 1 : p.5, we call it a two-sided test. When we use H 1 : p <.5 or H 1 : p >.5, we call it a one-sided test. PHP 2510 Lec 10: Hypothesis testing 11

12 Now let s get back to our original example: From a random sample of 10 people who are on a new diet, we observed cholesterol levels Can we test the hypothesis that people on the diet actually have the same mean cholesterol level as the general population? H 0 : µ = 188 First, let s do a two sided test. H 1 : µ 188. PHP 2510 Lec 10: Hypothesis testing 12

13 We start with the simplest case, as we did for confidence intervals. Let s assume that we know the standard deviation is 9.8. Under H 0, X10 N(188, /10) Now the question is, which values are as or more extreme? PHP 2510 Lec 10: Hypothesis testing 13

14 X 10 N(188, /10) Can you compute the probability of as or more extreme than 180.8? PHP 2510 Lec 10: Hypothesis testing 14

15 P( X 10 <= 180.8) = P( X /10 < /10 ) = P(Z < 2.32) =.01 p-value=2.01 =.02 <.05, so we would reject H 0 at significance level PHP 2510 Lec 10: Hypothesis testing 15

16 For the observed sample mean 180.8, we have rejected the H 0 at significance level What if the observations is 182? What about 183? Or more general, what are the values of X10 such that you would just reject H 0 at significance level.05? What we know under H 0 : distribution of normal Z If X /10 is the standard X < 1.96 or X > 1.96, we would reject H 0 at 9.82 / /10 significance level.05. If X /10 < z.01/2 = 2.58 or would reject H 0 at significance level.01. X /10 > z.01/2 = 2.58, we PHP 2510 Lec 10: Hypothesis testing 16

17 PHP 2510 Lec 10: Hypothesis testing 17

18 We call X /10 the test statistic and regions (, 1.96), (1.96, ) or (, 2.58), (, 2.58) critical regions. When the test statistic is inside the critical region, we reject H 0. We say X is a Z-statistic since it follows a standard 9.82 /10 normal distribution under H 0 model. In our example, X 10 = 180.8, so X = = / / is within the region (, 1.96), but not within the regions (, 2.58) or (, 2.58). Thus we reject H 0 at the.05 level, but not the.01 level. PHP 2510 Lec 10: Hypothesis testing 18

19 What if we have an one-sided alternative? H 0 : µ = 188 H 1 : µ < 188. Now which values are as or more extreme? PHP 2510 Lec 10: Hypothesis testing 19

20 X = = / /10 p-value: P(Z < 2.32) =.01 critical value for α =.05: P(Z < q) =.05 q = 1.64 critical value for α =.01: P(Z < q) =.05 q = 2.32 For one sided test, we would reject H 0 at both significance level 0.05 and.01 PHP 2510 Lec 10: Hypothesis testing 20

21 Summary of the steps in hypothesis testing A: In general 1. Select the probability model 2. Set up the null and alternative hypothesis 3. determine a test statistic 4. determine significance level and critical region 5. reject H 0 if test statistic is in critical region In previous example 1. Use a normal probability model with known variance 2. H 0 : µ = 188, H 1 : µ < Z statistic X µ σ/ n = α =.05,(, 1.64) in critical region, reject H 0 PHP 2510 Lec 10: Hypothesis testing 21

22 Summary of the steps in hypothesis testing B: In general 1. Select the probability model 2. Set up the null and alternative hypothesis 3. determine a test statistic 4. compute p-value 5. reject H 0 if p-value less than significance level In previous example 1. Use a normal probability model with known variance 2. H 0 : µ = 188, H 1 : µ < Z statistic X µ σ/ n = P(Z < 2.32) = < α =.05, reject H 0 PHP 2510 Lec 10: Hypothesis testing 22

23 Now, what if we do not know the standard deviation? Can we still do hypothesis testing? 1. Use a normal probability model with unknown variance 2. H 0 : µ = 188, H 1 : µ < 188 We can no longer form the Z-statistic. But we can estimate σ 2 and form a T statistic: From our data we have S 2 = 93, thus T = X µ S/ n = = /10 X µ S/ n t df=n 1 Method A: find critical region from t-distribution (df=9). t.05,df=9 = 1.83, thus we have critical region (, 1.83) The test statistics is in critical region, reject H 0 at significance level.05. Method B: p-value P(T < 2.36) = 0.02 <.05, reject H 0 at significance level.05. PHP 2510 Lec 10: Hypothesis testing 23

24 Hypothesis testing for comparing two means: I X 1,...,X n1 N(µ X, σx 2 )(σ2 X known) Y 1,...,Y n2 N(µ Y, σy 2 )(σ2 Y known) H 0 : µ X = µ Y or µ x µ Y = 0 T = X Ȳ (µ X µ Y ) σ 2 X /n 1 + σ 2 Y /n 2 N(0, 1) under H 0 PHP 2510 Lec 10: Hypothesis testing 24

25 Example: we had the cholesterol data from last lecture on confidence intervals X: Y: X = 180.8, Ȳ = 199. Suppose we know σ2 X = σ2 Y = T = / /20 = 4.79 One sided critical value for α =.05: (, 1.64) One sided critical value for α =.01: (, 2.32) Reject H 0. Or, compute p-value P(Z < 4.79) 0, reject H 0 PHP 2510 Lec 10: Hypothesis testing 25

26 Hypothesis testing for comparing two means: II X 1,...,X n1 N(µ X, σ 2 X ) Y 1,...,Y n2 N(µ Y, σ 2 Y ) (σ 2 X, σ2 Y unknown but equal) T = X Ȳ (µ X µ Y ) S 2 p/n 1 + S 2 p/n 2 t df=n1 +n 2 2 under H 0 Estimate common variance by pooled sample variance S 2 p = (if you forgot how this is done, review lecture 13). Form test statistic T = (1/10 + 1/20) = 4.69 PHP 2510 Lec 10: Hypothesis testing 26

27 One sided critical value for α =.05: (, t.05,df=28 ) = (, 1.70) One sided critical value for α =.01: (, t.01,df=28 ) = (, 2.47) Reject H 0 (exercise: What would be the critical regions if we were doing two-sided tests?) Or compute p-value P(t df=28 < 4.69) 0, reject H 0 PHP 2510 Lec 10: Hypothesis testing 27

28 Hypothesis testing for comparing two means: III X 1,...,X n1 N(µ X, σ 2 X ) Y 1,...,Y n2 N(µ Y, σ 2 Y ) (σ 2 X, σ2 Y unknown and unequal) T = X Ȳ (µ X µ Y ) S 2 X /n 1 + S 2 Y /n 2 Welch t under H 0 As we learned in previous lectures, the degree of freedom for this distribution is not simple. For large samples we know this converges to N(0,1), for small samples we can be conservative and use df = min(n 1 1, n 2 1) if there is no computer available. PHP 2510 Lec 10: Hypothesis testing 28

29 What happens when the two populations are not independent? What happens when we do not start with normal distributions? PHP 2510 Lec 10: Hypothesis testing 29

30 Difference of means of paired observations X N(µ X, σ 2 x) Y N(µ Y, σ 2 y) where X k is paired with Y k (before/after, left-hand/right-hand, paired treated/control...) H 0 : µ x = µ y, H 1 : µ x µ y But since X and Y are not independent, X and Ȳ are not independent. We do not have the simple result X Ȳ N(µ X µ Y, σx 2 /n + σ2 Y /n) (why not?) PHP 2510 Lec 10: Hypothesis testing 30

31 Solution: For each pair, we form the difference D k = X k Y k. Thus D 1,...,D k N(µ X µ Y, σd 2 ) and we estimate σ2 D with the sample variance SD 2. The test statistic for testing D = d 0 is then D d 0 S d / n degree of freedom n 1. student t with The most common test is for H 0 : D = 0. PHP 2510 Lec 10: Hypothesis testing 31

32 Example: Suppose you wish to test the effect of Prozac on the well-being of depressed individuals, using a standardised well-being scale. Higher scores indicate greater well-being (that is, Prozac is having a positive effect). We assume that the scores are approximately normally distributed. ID Pre Post PHP 2510 Lec 10: Hypothesis testing 32

33 ID Pre Post difference (post-pre) d = 3.67 S 2 d = H 0 : d = 0, H 1 : d 0 t = D 0 S/ = df = 8 9 Rejection region at α =.05: (2.306, ) and (, 2.306) Rejection region at α =.01: (3.355, ) and (, 3.355) PHP 2510 Lec 10: Hypothesis testing 33

34 When we do not start with normal distributions, Central limit theorem is our friend again, as long as we have large samples. PHP 2510 Lec 10: Hypothesis testing 34

35 Example: Bernoulli Suppose the incidence rate for children at 5 for disease W is.0137 (137 per 10,000) in We want to know if the incidence rate in Providence is the same as the national rate. A sample of 2000 children were randomly selected and their medical record queried to see if they had caught the disease in of them had the disease. PHP 2510 Lec 10: Hypothesis testing 35

36 1. 1. select a probability model: Bernoulli(p) H 0 : p = H 1 : p determine a test statistic: 2000 is a large sample, by CLT, X. N(p, p(1 p)/n), thus 4. Under H 0, X p p(1 p)/n N(0, 1) X (1.0137)/2000 N(0, 1) We observe X = 30/2000 =.015, thus the test statistic is (1.0137)/2000 = PHP 2510 Lec 10: Hypothesis testing 36

37 % % For two-sided test, the critical regions for α =.05 are (, 1.96) and (1.96, ). We do not reject H 0 at significance level 0.05 (thus we certainly do not reject H 0 at any more significant level, such as.01). OR, we compute the p-value 2P(Z >.519) = 2.30 = >.05 and we do not reject H 0 at significance level.05 PHP 2510 Lec 10: Hypothesis testing 37

38 Example: Suppose we want to compare the average daily visit for two emergency rooms, A and B. For each we record the daily visit number for a year. On average, there is 15.4 visits to ER A a day, and 14.8 visits to ER B a day. Do these two ERs have the same daily visit rates? 1. Probability model: Poisson, for events randomly happen over time 2. H 0 : λ A = λ B, H 1 : λ A λ B 3. Test statistic: For either ER we observe 365 days, by CLT, X. A N(λ A, λ A /n), X. B N(λ B, λ B /n), X A X. B N(λ A λ B, λ A /n + λ B /n) PHP 2510 Lec 10: Hypothesis testing 38

39 ( X A X B ) (λ A λ B ) λa /n + λ B /n. N(0, 1) 4. Under H 0, (λ A λ B ) = 0, ( X A X B ). N(0, 1) 2λ/n We can pool estimate λ from the two samples and get ˆλ = ( )/( ) = 15.1 Test statistic: ( ) /365 = 2.09 > Z.025 = 1.96 OR,p value = 2P(Z > 2.09) = =.036 <.05 Reject H 0. PHP 2510 Lec 10: Hypothesis testing 39

40 Review: The logic for hypothesis testing: If the null hypothesis, instead of the alternative hypothesis, is true, should I be surprised by the data? I am surprised, if the the probability of observing such or more extreme result is small based on H 0. This probability is called the p-value. By convention, most people reject the null hypothesis if p-value is smaller than The smaller the p-value, the stronger my doubt is about H 0, thus the more significant the result is against H 0. PHP 2510 Lec 10: Hypothesis testing 40

41 Review: The procedure for hypothesis testing 1. Select the probability model 2. Set up the null and alternative hypothesis 3. determine a test statistic 4. compute p-value 5. reject H 0 if p-value less than significance level PHP 2510 Lec 10: Hypothesis testing 41

42 Possible results from hypothesis testing: Decision H 0 True Reject, Type I error (α) Not reject H 1 True Reject Not reject, Type II error β Type I error: Rejecting H 0 when H 0 is true. Type II error: Fail to reject H 0 when H 0 is false (H 1 is true) Power: The ability to reject H 0 when H 0 is false P 0 ( Reject H 0 ) = P(Reject H 0 H 0 true) = α this probability is the significance level (Type I error rate) P 1 ( Reject H 0 ) = P(reject H 0 H 1 true) = 1 β this probability is the power of a hypothesis test PHP 2510 Lec 10: Hypothesis testing 42

43 Consider a one-sided test first: H 0 : µ = µ 0 = 5 H 1 : µ = µ 1 = 8 > µ 0. Suppose we have a normal model and know the variance is For a sample size of 100, we form the test statistic T = X 5 10/ = X Under H 0, we know T follows Z distribution critical region, reject H0 2 0 c For any decision rule reject H 0 if test statistic is greater than c, the type I error is the area of the area of red shaded region. PHP 2510 Lec 10: Hypothesis testing 43

44 Demo 1: type I error and choice of critical region PHP 2510 Lec 10: Hypothesis testing 44

45 What about type II error? X 8 Under H 1, 10/ = X 8 N(0, 1) 100 Under H 1, X 5 = ( X 8) + 3 N(3, 1) H0 β α H1 critical region, reject H We can try to reduce type I error by using a larger cutoff, but this would increase type II error (reducing power). We can try to increase power by using a smaller cutoff, but this would increase the type I error. PHP 2510 Lec 10: Hypothesis testing 45

46 Demo: the trade-off between type I and type II error. PHP 2510 Lec 10: Hypothesis testing 46

47 Two-sided test: critical region, reject H0 critical region, reject H PHP 2510 Lec 10: Hypothesis testing 47

48 We have seen that for a hypothesis test, there is a trade off between type I and II errors. For the same study design, we cannot simultaneously reduce both of them. The common practice is to fix the type I error at a small level, such as.05 or.01, so that we know that at least we are not rejecting H 0 too often when we should not. What other factors affect power, for a given type I value? PHP 2510 Lec 10: Hypothesis testing 48

49 (1. Type I error) 2. effect size If H 1 is true, the larger the difference between µ 0 (nullvalue) and µ 1 (alternative), the higher the power H0 β H1 H1 critical region, reject H0 α PHP 2510 Lec 10: Hypothesis testing 49

50 3. Sample size We know that X. N(µ, σ 2 /n). This means as sample size increases, the sample mean gets more concentrated near the true mean. Thus the null and alternative hypothesis becomes easier to separate. H0 H1 H0 H1 PHP 2510 Lec 10: Hypothesis testing 50

51 demo3 PHP 2510 Lec 10: Hypothesis testing 51

52 Computation of power 1. Write down the two hypothesis H 0 and H 1 2. Write down the probability models based on each hypothesis 3. determine the test statistic 4. For given type I error rate (α), effect size and sample size, determine the critical region (rejection region) 5. computer power 1-β PHP 2510 Lec 10: Hypothesis testing 52

53 Computation of power : one sided test Example: Suppose we want to test whether the mean of a population is 12 or less than 12. We assume normal distribution with known variance 36. What is the power of this test if the true mean is 10, and we have a sample size of 25, and set significance level.05? 1. H 0 : µ = 12;H 1 : µ < Under H 0 : X N(12, 36) X N(12, 36/n) Truth:X N(10, 36) X N(10, 36/n) 3. Test statistic: X Since we have a one sided test with H 1 : µ < 12, we will reject H 0 when the test statistic is less than a cutoff. X 12 α = 0.05 = P 0 (Reject H 0 ) = P 0 ( 36/ 25 < C) = P(Z < C) From the Z-table we know C = Under H 1 PHP 2510 Lec 10: Hypothesis testing 53

54 X 12 POWER = P 1 ( < 1.645) 36/ 25 = P 1 ( X /5 = P 1 ( X 10 6/5 = P 1 ( X 10 6/ /5 < < 1.645) < 1.645) /5 ) = P(Z <.0217) = 1 P(Z >.0217) =.52 PHP 2510 Lec 10: Hypothesis testing 54

55 Example: Suppose we want to test whether the mean of a population is 12 or greater than 12. We assume normal distribution with known variance 36. What is the power of this test if the true mean is 10, and we have a sample size of 25, and set significance level.05? 1. H 0 : µ = 12;H 1 : µ > Under H 0 : X N(12, 36) X N(12, 36/n) Truth : X N(10, 36) X N(10, 36/n) 3. Test statistic: X Since we have a one sided test with H 1 : µ > 12, we will reject H 0 when the test statistic is greater than a cutoff. X 12 α = 0.05 = P 0 (Reject H 0 ) = P 0 ( 36/ 25 > C) = P(Z > C) From the Z-table we know C = Under H 1 PHP 2510 Lec 10: Hypothesis testing 55

56 X 12 POWER = P 1 ( > 1.645) 36/ 25 = P 1 ( X /5 = P 1 ( X 10 6/5 = P 1 ( X 10 6/5 + = P(Z > 3.31) /5 > > 1.645) > 1.645) /5 ) When truth is µ = 10, the probability that you will be able to reject H 0 in a test for µ = 12 versus µ > 12, is nearly 0. PHP 2510 Lec 10: Hypothesis testing 56

57 Example: Suppose we want to test whether the mean of a population is 12 or not 12. We assume normal distribution with known variance 36. What is the power of this test if the true mean is 10, and we have a sample size of 25, and set significance level.05? 1. H 0 : µ = 12;H 1 : µ Under H 0 : X N(12, 36) X N(12, 36/n) Truth X N(10, 36) X N(10, 36/n) 3. Test statistic: X Since we have a two-sided test, we will reject H 0 when the absolute value of test statistic is greater than a cutoff. X 12 α = 0.05 = P 0 (Reject H 0 ) = P 0 ( 36/ 25 > C) = P( Z > C) From the Z-table we know C = Z.025 = Under H 1 PHP 2510 Lec 10: Hypothesis testing 57

58 X 12 P 1 ( > 1.96) 36/ 25 = P 1 ( X 12 6/5 = P 1 ( X /5 = P 1 ( X 10 6/5 > 1.96) + P 1 ( X 12 6/5 + = P(Z > 1.96 = P(Z > 3.63) + P(Z <.29) < 1.96) > 1.96) + P 1 ( X / / ) + P(Z < /5 6/5 ) > 1.96) + P 1 ( X 10 6/5 = P(Z > 3.63) + P(Z >.29) = /5 < 1.96) < 1.96) PHP 2510 Lec 10: Hypothesis testing 58

59 In general, for X 1, X 2,...,X n N(µ, σ 2 ), the power for test H 0 : µ = µ 0 versus H 1 : µ < µ 0 is P(Z < µ 0 µ 1 σ/ n Z α) the power for test H 0 : µ = µ 0 versus H 1 : µ > µ 0 is P(Z > µ 0 µ 1 σ/ n + Z α) the power for test H 0 : µ = µ 0 versus H 1 : µ µ 0 is P(Z < µ 0 µ 1 σ/ n Z α/2) + P(Z > µ 0 µ 1 σ/ n + Z α/2) PHP 2510 Lec 10: Hypothesis testing 59

60 Exercise: Now if I give you the same set up, just different numberes: different H 0 mean (not 12, but 200), different truth (not 10, but 180), different variance( not 36, but 64), different significance level (not.05, but.01), different sample size (not 25, but 49), can you compute the power for the test H 0 : µ = 200 versus H 1 : µ > 200? PHP 2510 Lec 10: Hypothesis testing 60

61 Next topic: We now know that type I error, Type II error (1-power), effect size and sample size are all connected. Can we determine a necessary sample size when we need to meet certain requirements of error rates? For one sample two-sided test:h 0 : µ = µ 0 versus µ µ 0 n = (Z α/2 + Z β ) 2 σ 2 (µ 1 µ 0 ) 2 For one sample one-sided test:h 0 : µ = µ 0 versus µ µ 0 n = (Z α + Z β ) 2 σ 2 (µ 1 µ 0 ) 2 the smaller the error rate, the larger the sample size the larger the variance, the larger the sample size the larger the effect size, the smaller the sample size PHP 2510 Lec 10: Hypothesis testing 61

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math

More information

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings Yujin Chung October 4th, 2016 Fall 2016 Yujin Chung Lec6: Statistical hypothesis testings Fall 2016 1/30 Previous Two types of statistical

More information

1 Hypothesis testing for a single mean

1 Hypothesis testing for a single mean This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Chapter 5: HYPOTHESIS TESTING

Chapter 5: HYPOTHESIS TESTING MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate

More information

CENTRAL LIMIT THEOREM (CLT)

CENTRAL LIMIT THEOREM (CLT) CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic

More information

Introduction to Statistics

Introduction to Statistics MTH4106 Introduction to Statistics Notes 15 Spring 2013 Testing hypotheses about the mean Earlier, we saw how to test hypotheses about a proportion, using properties of the Binomial distribution It is

More information

Introductory Econometrics. Review of statistics (Part II: Inference)

Introductory Econometrics. Review of statistics (Part II: Inference) Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing

More information

Chapter 7: Hypothesis Testing

Chapter 7: Hypothesis Testing Chapter 7: Hypothesis Testing *Mathematical statistics with applications; Elsevier Academic Press, 2009 The elements of a statistical hypothesis 1. The null hypothesis, denoted by H 0, is usually the nullification

More information

Midterm 1 and 2 results

Midterm 1 and 2 results Midterm 1 and 2 results Midterm 1 Midterm 2 ------------------------------ Min. :40.00 Min. : 20.0 1st Qu.:60.00 1st Qu.:60.00 Median :75.00 Median :70.0 Mean :71.97 Mean :69.77 3rd Qu.:85.00 3rd Qu.:85.0

More information

Content by Week Week of October 14 27

Content by Week Week of October 14 27 Content by Week Week of October 14 27 Learning objectives By the end of this week, you should be able to: Understand the purpose and interpretation of confidence intervals for the mean, Calculate confidence

More information

280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses

280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses 280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE 9-1.2 Tests of Statistical Hypotheses To illustrate the general concepts, consider the propellant burning rate problem introduced earlier. The null

More information

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit

More information

Hypothesis tests

Hypothesis tests 6.1 6.4 Hypothesis tests Prof. Tesler Math 186 February 26, 2014 Prof. Tesler 6.1 6.4 Hypothesis tests Math 186 / February 26, 2014 1 / 41 6.1 6.2 Intro to hypothesis tests and decision rules Hypothesis

More information

10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing

10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing & z-test Lecture Set 11 We have a coin and are trying to determine if it is biased or unbiased What should we assume? Why? Flip coin n = 100 times E(Heads) = 50 Why? Assume we count 53 Heads... What could

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation

More information

Samples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I.

Samples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I. Statistiek I Sampling John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/41 Overview 1 Samples and Populations 2 Confidence Intervals 3 Hypotheses

More information

F79SM STATISTICAL METHODS

F79SM STATISTICAL METHODS F79SM STATISTICAL METHODS SUMMARY NOTES 9 Hypothesis testing 9.1 Introduction As before we have a random sample x of size n of a population r.v. X with pdf/pf f(x;θ). The distribution we assign to X is

More information

Chapter 24. Comparing Means

Chapter 24. Comparing Means Chapter 4 Comparing Means!1 /34 Homework p579, 5, 7, 8, 10, 11, 17, 31, 3! /34 !3 /34 Objective Students test null and alternate hypothesis about two!4 /34 Plot the Data The intuitive display for comparing

More information

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017 Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2017 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test statistic f (x H 0

More information

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 Winter 2012 Lecture 13 (Winter 2011) Estimation Lecture 13 1 / 33 Review of Main Concepts Sampling Distribution of Sample Mean

More information

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual Question 1. Suppose you want to estimate the percentage of

More information

Ch. 7. One sample hypothesis tests for µ and σ

Ch. 7. One sample hypothesis tests for µ and σ Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math

More information

Chapter 9 Inferences from Two Samples

Chapter 9 Inferences from Two Samples Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review

More information

23. MORE HYPOTHESIS TESTING

23. MORE HYPOTHESIS TESTING 23. MORE HYPOTHESIS TESTING The Logic Behind Hypothesis Testing For simplicity, consider testing H 0 : µ = µ 0 against the two-sided alternative H A : µ µ 0. Even if H 0 is true (so that the expectation

More information

Lecture 1: Probability Fundamentals

Lecture 1: Probability Fundamentals Lecture 1: Probability Fundamentals IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge January 22nd, 2008 Rasmussen (CUED) Lecture 1: Probability

More information

Lab #12: Exam 3 Review Key

Lab #12: Exam 3 Review Key Psychological Statistics Practice Lab#1 Dr. M. Plonsky Page 1 of 7 Lab #1: Exam 3 Review Key 1) a. Probability - Refers to the likelihood that an event will occur. Ranges from 0 to 1. b. Sampling Distribution

More information

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8 Concepts from previous lectures HUMBEHV 3HB3 one-sample t-tests Week 8 Prof. Patrick Bennett sampling distributions - sampling error - standard error of the mean - degrees-of-freedom Null and alternative/research

More information

Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT):

Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT): Lecture Three Normal theory null distributions Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT): A random variable which is a sum of many

More information

1 Statistical inference for a population mean

1 Statistical inference for a population mean 1 Statistical inference for a population mean 1. Inference for a large sample, known variance Suppose X 1,..., X n represents a large random sample of data from a population with unknown mean µ and known

More information

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size Student s t-distribution The t-distribution, t-tests, & Measures of Effect Size Sampling Distributions Redux Chapter 7 opens with a return to the concept of sampling distributions from chapter 4 Sampling

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

Lecture 10: Comparing two populations: proportions

Lecture 10: Comparing two populations: proportions Lecture 10: Comparing two populations: proportions Problem: Compare two sets of sample data: e.g. is the proportion of As in this semester 152 the same as last Fall? Methods: Extend the methods introduced

More information

OHSU OGI Class ECE-580-DOE :Statistical Process Control and Design of Experiments Steve Brainerd Basic Statistics Sample size?

OHSU OGI Class ECE-580-DOE :Statistical Process Control and Design of Experiments Steve Brainerd Basic Statistics Sample size? ECE-580-DOE :Statistical Process Control and Design of Experiments Steve Basic Statistics Sample size? Sample size determination: text section 2-4-2 Page 41 section 3-7 Page 107 Website::http://www.stat.uiowa.edu/~rlenth/Power/

More information

Null Hypothesis Significance Testing p-values, significance level, power, t-tests

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2014 January 1, 2017 1 /22 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test

More information

POLI 443 Applied Political Research

POLI 443 Applied Political Research POLI 443 Applied Political Research Session 4 Tests of Hypotheses The Normal Curve Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College

More information

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6. Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized

More information

Introduction to Statistical Inference

Introduction to Statistical Inference Introduction to Statistical Inference Dr. Fatima Sanchez-Cabo f.sanchezcabo@tugraz.at http://www.genome.tugraz.at Institute for Genomics and Bioinformatics, Graz University of Technology, Austria Introduction

More information

Two Sample Problems. Two sample problems

Two Sample Problems. Two sample problems Two Sample Problems Two sample problems The goal of inference is to compare the responses in two groups. Each group is a sample from a different population. The responses in each group are independent

More information

Evaluating Hypotheses

Evaluating Hypotheses Evaluating Hypotheses IEEE Expert, October 1996 1 Evaluating Hypotheses Sample error, true error Confidence intervals for observed hypothesis error Estimators Binomial distribution, Normal distribution,

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

Introduction to Business Statistics QM 220 Chapter 12

Introduction to Business Statistics QM 220 Chapter 12 Department of Quantitative Methods & Information Systems Introduction to Business Statistics QM 220 Chapter 12 Dr. Mohammad Zainal 12.1 The F distribution We already covered this topic in Ch. 10 QM-220,

More information

TOPIC 12: RANDOM VARIABLES AND THEIR DISTRIBUTIONS

TOPIC 12: RANDOM VARIABLES AND THEIR DISTRIBUTIONS TOPIC : RANDOM VARIABLES AND THEIR DISTRIBUTIONS In the last section we compared the length of the longest run in the data for various players to our expectations for the longest run in data generated

More information

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval Epidemiology 9509 Principles of Biostatistics Chapter 10 - Inferences about John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. differences in

More information

Hypothesis Testing. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis.

Hypothesis Testing. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis. Hypothesis Testing Today, we are going to begin talking about the idea of hypothesis testing how we can use statistics to show that our causal models are valid or invalid. We normally talk about two types

More information

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last week: Sample, population and sampling

More information

Chapter 7 Comparison of two independent samples

Chapter 7 Comparison of two independent samples Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 6 Patrick Breheny University of Iowa to Biostatistics (BIOS 4120) 1 / 36 Our next several lectures will deal with two-sample inference for continuous

More information

Hypothesis Testing. ) the hypothesis that suggests no change from previous experience

Hypothesis Testing. ) the hypothesis that suggests no change from previous experience Hypothesis Testing Definitions Hypothesis a claim about something Null hypothesis ( H 0 ) the hypothesis that suggests no change from previous experience Alternative hypothesis ( H 1 ) the hypothesis that

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

Difference between means - t-test /25

Difference between means - t-test /25 Difference between means - t-test 1 Discussion Question p492 Ex 9-4 p492 1-3, 6-8, 12 Assume all variances are not equal. Ignore the test for variance. 2 Students will perform hypothesis tests for two

More information

Statistics and Sampling distributions

Statistics and Sampling distributions Statistics and Sampling distributions a statistic is a numerical summary of sample data. It is a rv. The distribution of a statistic is called its sampling distribution. The rv s X 1, X 2,, X n are said

More information

Lecture 15: Inference Based on Two Samples

Lecture 15: Inference Based on Two Samples Lecture 15: Inference Based on Two Samples MSU-STT 351-Sum17B (P. Vellaisamy: STT 351-Sum17B) Probability & Statistics for Engineers 1 / 26 9.1 Z-tests and CI s for (µ 1 µ 2 ) The assumptions: (i) X =

More information

One-sample categorical data: approximate inference

One-sample categorical data: approximate inference One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information

For use only in [the name of your school] 2014 S4 Note. S4 Notes (Edexcel)

For use only in [the name of your school] 2014 S4 Note. S4 Notes (Edexcel) s (Edexcel) Copyright www.pgmaths.co.uk - For AS, A2 notes and IGCSE / GCSE worksheets 1 Copyright www.pgmaths.co.uk - For AS, A2 notes and IGCSE / GCSE worksheets 2 Copyright www.pgmaths.co.uk - For AS,

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

Chapter 3. Comparing two populations

Chapter 3. Comparing two populations Chapter 3. Comparing two populations Contents Hypothesis for the difference between two population means: matched pairs Hypothesis for the difference between two population means: independent samples Two

More information

Last few slides from last time

Last few slides from last time Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test 1 Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test Learning Objectives After this section, you should be able to DESCRIBE the relationship between the significance level of a test, P(Type

More information

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015 AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking

More information

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics Originality in the Arts and Sciences: Lecture 2: Probability and Statistics Let s face it. Statistics has a really bad reputation. Why? 1. It is boring. 2. It doesn t make a lot of sense. Actually, the

More information

Lab #11. Variable B. Variable A Y a b a+b N c d c+d a+c b+d N = a+b+c+d

Lab #11. Variable B. Variable A Y a b a+b N c d c+d a+c b+d N = a+b+c+d BIOS 4120: Introduction to Biostatistics Breheny Lab #11 We will explore observational studies in today s lab and review how to make inferences on contingency tables. We will only use 2x2 tables for today

More information

Central Limit Theorem and the Law of Large Numbers Class 6, Jeremy Orloff and Jonathan Bloom

Central Limit Theorem and the Law of Large Numbers Class 6, Jeremy Orloff and Jonathan Bloom Central Limit Theorem and the Law of Large Numbers Class 6, 8.5 Jeremy Orloff and Jonathan Bloom Learning Goals. Understand the statement of the law of large numbers. 2. Understand the statement of the

More information

Chapter 23. Inference About Means

Chapter 23. Inference About Means Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that

More information

Medical statistics part I, autumn 2010: One sample test of hypothesis

Medical statistics part I, autumn 2010: One sample test of hypothesis Medical statistics part I, autumn 2010: One sample test of hypothesis Eirik Skogvoll Consultant/ Professor Faculty of Medicine Dept. of Anaesthesiology and Emergency Medicine 1 What is a hypothesis test?

More information

CONTINUOUS RANDOM VARIABLES

CONTINUOUS RANDOM VARIABLES the Further Mathematics network www.fmnetwork.org.uk V 07 REVISION SHEET STATISTICS (AQA) CONTINUOUS RANDOM VARIABLES The main ideas are: Properties of Continuous Random Variables Mean, Median and Mode

More information

Lecture 17. Ingo Ruczinski. October 26, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

Lecture 17. Ingo Ruczinski. October 26, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University Lecture 17 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University October 26, 2015 1 2 3 4 5 1 Paired difference hypothesis tests 2 Independent group differences

More information

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Sampling A trait is measured on each member of a population. f(y) = propn of individuals in the popn with measurement

More information

Quantitative Analysis and Empirical Methods

Quantitative Analysis and Empirical Methods Hypothesis testing Sciences Po, Paris, CEE / LIEPP Introduction Hypotheses Procedure of hypothesis testing Two-tailed and one-tailed tests Statistical tests with categorical variables A hypothesis A testable

More information

Precept 4: Hypothesis Testing

Precept 4: Hypothesis Testing Precept 4: Hypothesis Testing Soc 500: Applied Social Statistics Ian Lundberg Princeton University October 6, 2016 Learning Objectives 1 Introduce vectorized R code 2 Review homework and talk about RMarkdown

More information

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 12 - Lecture 2 Inferences about regression coefficient Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous

More information

Topic 15: Simple Hypotheses

Topic 15: Simple Hypotheses Topic 15: November 10, 2009 In the simplest set-up for a statistical hypothesis, we consider two values θ 0, θ 1 in the parameter space. We write the test as H 0 : θ = θ 0 versus H 1 : θ = θ 1. H 0 is

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions

More information

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs STATISTICS 4 Summary Notes. Geometric and Exponential Distributions GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs P(X = x) = ( p) x p x =,, 3,...

More information

Lecture 4: Random Variables and Distributions

Lecture 4: Random Variables and Distributions Lecture 4: Random Variables and Distributions Goals Random Variables Overview of discrete and continuous distributions important in genetics/genomics Working with distributions in R Random Variables A

More information

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Quantitative Methods for Economics, Finance and Management (A86050 F86050) Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge

More information

Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing. Road Map Sampling Distributions, Confidence Intervals & Hypothesis Testing

Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing. Road Map Sampling Distributions, Confidence Intervals & Hypothesis Testing Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing ECO22Y5Y: Quantitative Methods in Economics Dr. Nick Zammit University of Toronto Department of Economics Room KN3272 n.zammit

More information

Pump failure data. Pump Failures Time

Pump failure data. Pump Failures Time Outline 1. Poisson distribution 2. Tests of hypothesis for a single Poisson mean 3. Comparing multiple Poisson means 4. Likelihood equivalence with exponential model Pump failure data Pump 1 2 3 4 5 Failures

More information

Chapter 7: Statistical Inference (Two Samples)

Chapter 7: Statistical Inference (Two Samples) Chapter 7: Statistical Inference (Two Samples) Shiwen Shen University of South Carolina 2016 Fall Section 003 1 / 41 Motivation of Inference on Two Samples Until now we have been mainly interested in a

More information

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3) STAT 515 -- Chapter 9: Two-Sample Problems Paired Differences (Section 9.3) Examples of Paired Differences studies: Similar subjects are paired off and one of two treatments is given to each subject in

More information

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: CHAPTER 9, 10 Hypothesis Testing Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: The person is guilty. The person is innocent. To

More information

Midterm Exam 1 Solution

Midterm Exam 1 Solution EECS 126 Probability and Random Processes University of California, Berkeley: Fall 2015 Kannan Ramchandran September 22, 2015 Midterm Exam 1 Solution Last name First name SID Name of student on your left:

More information

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm Math 408 - Mathematical Statistics Lecture 29-30. Testing Hypotheses: The Neyman-Pearson Paradigm April 12-15, 2013 Konstantin Zuev (USC) Math 408, Lecture 29-30 April 12-15, 2013 1 / 12 Agenda Example:

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1 4 Hypothesis testing 4. Simple hypotheses A computer tries to distinguish between two sources of signals. Both sources emit independent signals with normally distributed intensity, the signals of the first

More information

CS 5014: Research Methods in Computer Science. Bernoulli Distribution. Binomial Distribution. Poisson Distribution. Clifford A. Shaffer.

CS 5014: Research Methods in Computer Science. Bernoulli Distribution. Binomial Distribution. Poisson Distribution. Clifford A. Shaffer. Department of Computer Science Virginia Tech Blacksburg, Virginia Copyright c 2015 by Clifford A. Shaffer Computer Science Title page Computer Science Clifford A. Shaffer Fall 2015 Clifford A. Shaffer

More information

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests Overall Overview INFOWO Statistics lecture S3: Hypothesis testing Peter de Waal Department of Information and Computing Sciences Faculty of Science, Universiteit Utrecht 1 Descriptive statistics 2 Scores

More information

Lecture on Null Hypothesis Testing & Temporal Correlation

Lecture on Null Hypothesis Testing & Temporal Correlation Lecture on Null Hypothesis Testing & Temporal Correlation CS 590.21 Analysis and Modeling of Brain Networks Department of Computer Science University of Crete Acknowledgement Resources used in the slides

More information

Hypothesis Testing. ECE 3530 Spring Antonio Paiva

Hypothesis Testing. ECE 3530 Spring Antonio Paiva Hypothesis Testing ECE 3530 Spring 2010 Antonio Paiva What is hypothesis testing? A statistical hypothesis is an assertion or conjecture concerning one or more populations. To prove that a hypothesis is

More information

Elementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing

Elementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing (Section 8-2) Hypotheses testing is not all that different from confidence intervals, so let s do a quick review of the theory behind the latter. If it s our goal to estimate the mean of a population,

More information

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc. Chapter 24 Comparing Means Copyright 2010 Pearson Education, Inc. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side. For example:

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

review session gov 2000 gov 2000 () review session 1 / 38

review session gov 2000 gov 2000 () review session 1 / 38 review session gov 2000 gov 2000 () review session 1 / 38 Overview Random Variables and Probability Univariate Statistics Bivariate Statistics Multivariate Statistics Causal Inference gov 2000 () review

More information

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!!  (preferred!)!! Probability theory and inference statistics Dr. Paola Grosso SNE research group p.grosso@uva.nl paola.grosso@os3.nl (preferred) Roadmap Lecture 1: Monday Sep. 22nd Collecting data Presenting data Descriptive

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information