BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings
|
|
- Irma Foster
- 5 years ago
- Views:
Transcription
1 BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings Yujin Chung October 4th, 2016 Fall 2016 Yujin Chung Lec6: Statistical hypothesis testings Fall /30
2 Previous Two types of statistical inferences: Estimation: concerned with estimating the values of specific population parameters. These specific values are referred to as point estimates. Sometimes, interval estimation is carried out to specify an interval which likely includes the parameter values. Hypothesis testing: concerned with testing whether the value of a population parameter is equal to some specific value Yujin Chung Lec6: Statistical hypothesis testings Fall /30
3 Hypothesis testing Philosophy: prove a claim by contradiction. Analogy: dependent love story Claim : You don t love me. Reasoning : If you loved me, you would take the trash out every week and put your socks away. Data : Some weeks you don t take the trash out or leave your socks where they fall. Conclusion : You don t love me. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
4 Statistical hypothesis testing Hypothesis-testing framework specifies two hypotheses: null and alternative hypothesis The null hypothesis (H0) is often an initial claim that researchers specify using previous research or knowledge. Typically it is a statement that the value of a population parameter (such as proportion, mean, or standard deviation) is equal to some claimed value. The alternative hypothesis (H1) is what you might believe to be true or hope to prove true. H0 : you love me vs. H1 : you don t love me Hypothesis-testing provides an objective framework for making decisions using probabilities methods, rather than relying on subjective impressions. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
5 Examples The average of cholesterol level in children is 175mg/dL. A group of men who have died from heart disease within the past year are identified, and the cholesterol of their offspring are measured. (1) Is the average cholesterol level of these children larger than 175mg/dL? (2) Is the average cholesterol level different from that of children whose fathers do not have a history of heart disease? µ 1 : the population mean of cholesterol level in the case group µ 2 : the poulation mean of cholesterol level in the control group (1) H0: µ1 = 175 vs. H1: µ 1 > 175 (2) H0: µ1 = µ 2 vs. H1: µ 1 µ 2 Are the IQ and the number of finger-wrist taps (fwt) of children in the lead exposed group different from those of children in the control group? H0: the population mean of fwt in the two groups are the same vs. H1: the means are different Yujin Chung Lec6: Statistical hypothesis testings Fall /30
6 Four possible outcomes in hypothesis testing No reject H0 Reject H0 H0 true true negative false positive (1 α) Type I error (α) H1 true false negative true positive Type II error (β) Power (1 β) Two possible errors type I error (α): Pr(Reject H0 H0 true). commonly referred to as the significance level of a test. type II error (β): Pr(Not reject H0 H1 true) The power of a test: 1 β = Pr(Reject H0 H1 true) We prefer a test with small α and large power (1 β). Statistical hypothesis test: the greatest power (1 β) among all possible tests of a given type I error α Yujin Chung Lec6: Statistical hypothesis testings Fall /30
7 t-test for the Mean We assume the cholesterol levels in children follow N(µ, σ 2 ). We wish to test whether the cholesterol levels of children with family history is same as 175mg/dL, the average cholesterol without family or larger than 175. Hypotheses H 0 : µ = 175 vs H 1 : µ > 175. Logic: 1 Assume H 0 is true 2 If x is too large, it is a contradiction to the assumption that H 0 is true. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
8 t-test for the mean: critical-value method H 0 : µ = 175 vs H 1 : µ > 175. The distribution of X under H0 Since X 1,..., X n N(175, σ 2 ), t = X 175 S/ n t n 1. Critical value method: H0:µ=175 vs. H1:µ>175 Density function of t n Acceptance region Rejection region 4 0 t n 11 α 4 Critical value: t n 1,1 α If t > t n 1,1 α, reject H 0 ; if t t n 1,α, not reject H 0 Type I error: Pr(t > t n 1,1 α H0) = α; Yujin Chung Lec6: Statistical hypothesis testings Fall /30
9 t-test for the Mean: p-value method H 0 : µ = 175 vs H 1 : µ > 175. Test statistic: t = X 175 S/ n. Under H 0 : test statistic t t n 1 p-value: Pr(t > t(obs) H 0 ), the probability of obtaining a test statistic as extreme as or more extreme than the actual test statistic value, given that H 0 is true. p value for the test: H0:µ=175 vs. H1:µ>175 Density function of t n Rejection region p value: Pr(t>t(obs)) 4 0 t n 11 α t(obs) 4 If p-value < α, reject H 0 ; if p-value α, not reject H 0. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
10 Significance level: α H 0 is rejected if t > t n 1,1 α or p-value< α. α: significance level, type-i error, typically set to 0.05 Guidelines for judging the significance of a p-value If p 0.05, then the results are considered not statistically significant If 0.01 p < 0.05, then the results are statistically significant If p < 0.01, then the results are highly significant If p < 0.001, then the results are very highly significant Report an exact p-value! The p-value indicates exactly how significant the results are without performing repeated significance tests at different α levels. The p-value indicate how close to statistical significance the results have come even when they are not statistically significant Yujin Chung Lec6: Statistical hypothesis testings Fall /30
11 Example: the cholesterol of children Suppose the mean cholesterol level of 10 children whose fathers died from heart disease is 200 mg/dl and the sample standard deviation is 50 mg/dl. The average of cholesterol level in children is known as 175mg/dL. Is the average cholesterol level of these children larger than 175mg/dL? Let µ be the population mean cholesterol level of children whose fathers died from heart disease. The hypotheses are H 0 : µ = 175 vs. H 1 : µ > 175. The test statistic is t = x µ 0 s/ n and follows t n 1 under H 0. The observed test statistic is / n = Critical-value method At the significance level 5%, the critical value is t n 1,1 α = t 9,0.95 = and the rejection region is t > Since t(obs) = 1.58 < 1.833, we cannot reject H 0 at significance level 5% Yujin Chung Lec6: Statistical hypothesis testings Fall /30
12 Example: the cholesterol of children Suppose the mean cholesterol level of 10 children whose fathers died from heart disease is 200 mg/dl and the sample standard deviation is 50 mg/dl. The average of cholesterol level in children is known as 175mg/dL. Is the average cholesterol level of these children larger than 175mg/dL? Let µ be the population mean cholesterol level of children whose fathers died from heart disease. The hypotheses are H 0 : µ = 175 vs. H 1 : µ > 175. The test statistic is t = x µ0 s/ n and follows t n 1 = t 9 under H 0. The observed test statistic is / n = p-value method The p-value is p = Pr(t > t(obs) H 0 ) = Pr(t > 1.58 H 0 ) = Since p > 0.05, we cannot reject H 0 at significance level 5% Yujin Chung Lec6: Statistical hypothesis testings Fall /30
13 One-tailed t-test for the mean A one-tailed test is a test in which the values of the parameter being studied under the alternative hypothesis are allowed to be either greater than or less than the values of the parameter under the null hypothesis (µ 0 ) but not both. H 0 : µ = µ 0 test statistic: t = X µ 0 S/ n H 1 rejection region p-value µ > µ 0 t > t n 1,1 α Pr(t > t(obs) H 0 ) µ < µ 0 t < t n 1,α Pr(t < t(obs) H 0 ) Yujin Chung Lec6: Statistical hypothesis testings Fall /30
14 Two-sided alternatives The test for H0 : µ = µ 0 vs. H1 : µ µ 0 is based on t = x µ 0 s/ n. Critical-value method Rejection region: If t > t n 1,1 α/2, then H0 is rejected. Acceptance region: If t > tn 1,1 α/2, then H0 is NOT rejected. Type-I error: Pr( t > tn 1,1 α/2 H 0 ) = α. p-value method: p-value is Pr( t > t(obs) H 0 ). If p < 0.05, then H 0 is rejected at significance level 5%. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
15 Example: two-sided alternatives (continued from the cholesterol data) Is the average cholesterol level of children whose fathers had heart disease different from the US average cholesterol level (175) of children? The hypotheses are H 0 : µ = 175 vs. H 1 : µ 175. The observed test statistic is t(obs) = Critical-value method: At significance level of 5%, the rejection region is t > or t < Since the observed test statistic is < 1.58 < 2.262, we cannot reject H 0 at significance level 5%. p-value method: p = Pr( t > t(obs) H 0 ) = Pr(t > 1.58 H 0 ) + Pr(t < 1.58 H 0 ) = Since p > 0.05, we cannot reject H 0 at significance level 5%. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
16 The Relationship Between Hypothesis Testing and Confidence Intervals Suppose we are testing H0 : µ = µ 0 vs. H1 : µ µ 0. H 0 is rejected at significance level α, if and only if 100% (1 α) CI for µ does not contain µ 0. Recall that 100%(1 α) CI for µ is x ± t n 1,1 α/2 s/ n. If H 0 is rejected, t = x µ 0 s/ n < t n 1,1 α/2 or t > t n 1,1 α/2 x µ 0 < t n 1,1 α/2 s/ n or x µ 0 > t n 1,1 α/2 s/ n µ 0 > x + t n 1,1 α/2 s/ n or µ 0 < x t n 1,1 α/2 s/ n Therefore, 100%(1 α) CI does not include µ 0. Similarly, the inverse can be proved. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
17 z-test for the Mean with Known Variance Let X 1,..., X n N(µ, σ 2 ) and σ 2 is known. The test for H 0 : µ = µ 0 is based on the test statistic Z = X µ 0 σ/ n which follows N(0, 1) under H 0. H 1 Rejection region p-value µ > µ 0 z > z 1 α Pr(Z > z(obs) H 0 ) µ < µ 0 z < z α Pr(Z < z(obs) H 0 ) µ µ 0 z > z 1 α/2 Pr( Z > z(obs) H 0 ) Yujin Chung Lec6: Statistical hypothesis testings Fall /30
18 Tests for Binomial probability p Let X Binomial(n, p). We want to test H 0 : p = p 0 vs. H 1 : p p 0. ˆp p 0 The test statistic is Z =, where ˆp = X/n. If p0 (1 p 0 )/n np 0 (1 p 0 ) 5, Z N(0, 1) under H 0. Rejection region: z(obs) < z α/2 or z(obs) > z 1 α/2 p-value: If ˆp p0, then p = 2 Pr(Z < z(obs) H 0 ) If ˆp > p0, then p = 2 Pr(Z > z(obs) H 0 ) Yujin Chung Lec6: Statistical hypothesis testings Fall /30
19 Two-sample case In a two-sample hypothesis-testing problem, the underlying parameters of two different populations are compared. fwt left and right maxfwt in the control group and exposed group Independent samples: when the data points in one sample are unrelated to the data points in the second sample. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
20 The paired sample: Paired t-test Paired sample: when each data point in the first sample is matched and is related to a unique data point in the second sample. Paired samples may represent two sets of measurements on the same people or on different people who are chosen on an individual basis using matching criteria, such as age and sex, to be very similar to each other. LEAD data: The numbers of right-hand and left-hand finger-wrist tapping (fwt r and fwt l), respectively, were observed from each of 124 children. We want to test whether the number of finger-wrist tapping is different between right hand and left hand. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
21 The paired sample: paired t-test Consider two samples: (X 1,1, X 2,1 ),..., (X 1,n, X 2,n ), where E(X 1,i ) = µ 1 and E(X 2,i ) = µ 2 for all i = 1,..., n. We want to test H 0 : µ 1 = µ 2 vs. H 1 : µ 1 µ 2. Let = µ 1 µ 2. Then, H 0 : = 0 vs. H 1 : 0 To get rid of the correlation X 1,i and X 2,i, we consider the differences d i = X 1,i X 2,i for i = 1,..., n. We assume d 1,..., d n N(, σd 2 ). It is a one-sample t-test problem. Test statistic: t = d s d / n, where d and s d are the sample mean and standard deviation of the differences, respectively. Under H 0 : t t n 1 p-value= 2 Pr(t > t(obs) H 0 ) CI: d ± t n 1,1 α/2 s d / n Yujin Chung Lec6: Statistical hypothesis testings Fall /30
22 Example: Lead data The numbers of right-hand and left-hand finger-wrist tapping (fwt r and fwt l), respectively, were observed from each of 124 children. We want to test whether the number of finger-wrist tapping is different between right hand and left hand. Since fwt r and fwt l are not independent and paired sample, we consider the difference of them. H 0 : = 0 vs. H 1 : 0 mean of difference: d = and s.d. sd = test statistic: t = / 124 = The distribution of test statistic: t p-value: 2 Pr(t > ) = At significance level 5%, we reject the null. Right- and left- hand fwt are significantly different. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
23 Two independent samples: equal variances Consider two independent samples: X 1,1,..., X 1,n1 N(µ 1, σ 2 1 ) (sample size n 1 ) and X 2,1,..., X 2,n2 N(µ 2, σ 2 2 ) (sample size n 2). We want to test H 0 : µ 1 = µ 2 vs. H 1 : µ 1 µ 2. We assume σ 2 = σ 2 1 = σ2 2. X 1 N(µ 1, σ 2 /n 1 ), X2 N(µ 2, σ 2 /n 2 ) X 1 X 2 N(µ 1 µ 2, σ 2 /n 1 + σ 2 /n 2 ) the pooled variance estimation of σ 2 : s 2 = (n 1 1)s (n 2 1)s 2 2, weighted average of s 2 1 n 1 + n 2 2 and s2 2 X 1 Test statistic: t = X 2 s t n1 +n 2 2 under H 0. 1/n 1 + 1/n 2 Rejection region: t(obs) > t n1 +n 2 2,1 α/2 or t(obs) < t n1 +n 2 2,1 α/2 p-value= 2 Pr(t > t(obs) H 0 ) CI: ( x 1 x 2 ) ± t n1 +n 2 2,1 α/2s 2 1/n 1 + 1/n 2 Yujin Chung Lec6: Statistical hypothesis testings Fall /30
24 Example: Lead data We now assume the variances of maxfwt in the control (n 1 = 78) and exposed group (n 2 = 46) are the same. We want to test for H 0 : µ 1 = µ 2 vs. H 1 : µ 1 µ 2. sample means x 1 = 62.44, x 2 = 59.76; sample variances: s 2 1 = and s2 2 = pooled variance: s 2 (78 1) (46 1) = = test statistic: t = = (1/78 + 1/46) Under H 0, t t 122 (df= =122) p-value: 2 Pr(t > ) = At significance level 5%, we cannot reject the null hypothesis. No evidence of different means. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
25 Two independent samples: different variances Two samples: X 1,1,..., X 1,n1 N(µ 1, σ 2 1 ) (sample size n 1) and X 2,1,..., X 2,n2 N(µ 2, σ 2 2 ) (sample size n 2). We want to test H 0 : µ 1 = µ 2 vs. H 1 : µ 1 µ 2. X 1 X 2 N(µ 1 µ 2, σ1 2/n 1 + σ1 2/n 2) X 1 Test statistic: t = X 2 s 2 1 /n 1 + s 2 2 /n 2 Under H 0 : the test statistic approximately follows t-distribution with d.f. d (s 2 1 = /n 1 + s 2 2 /n 2) 2 (s 2 1 /n 1) 2 /(n 1 1) + (s 2 2 /n 2) 2 /(n 2 1) Rejection region: t(obs) > t d,1 α/2 or t(obs) < t d,1 α/2 p-value= 2 Pr(t > t(obs) H 0 ) CI: ( x 1 x 2 ) ± t d,1 α/2 s 2 1 /n 1 + s 2 2 /n 2 Yujin Chung Lec6: Statistical hypothesis testings Fall /30
26 F-test for the Equal Variances Two samples: X 1,1,..., X 1,n1 N(µ 1, σ 2 1 ) (sample size n 1) and X 2,1,..., X 2,n2 N(µ 2, σ 2 2 ) (sample size n 2). We want to test H 0 : σ 2 1 = σ2 2 vs. H 1 : σ 2 1 σ2 2. In other words, H 0 : σ 2 1 /σ2 2 = 1 vs. H 1 : σ 2 1 /σ2 2 1 (n 1 1)S 2 1 /σ2 1 χ2 n 1 1 and (n 2 2)S 2 2 /σ2 2 χ2 n 2 1 Test statistic: F = S2 1 S2 2 F n1 1,n 2 1 under H 0 Rejection region: F (obs) > F n1 1,n 2 1,1 α/2 or F (obs) < F n2 1,n 1 1,α/2 p-value If F (obs) 1, then p = 2 Pr(F > F (obs) H0 ) If F (obs) < 1, then p = 2 Pr(F < F (obs) H0 ) Yujin Chung Lec6: Statistical hypothesis testings Fall /30
27 Example: Lead data Test whether the variances of maxfwt in the control (n 1 = 78) and exposed group (n 2 = 46) are the same or not. H 0 : σ 2 1 = σ2 2 vs. H 0 : σ 2 1 σ2 2 sample variances: s 2 1 = and s2 2 = test statistic: F = s 2 1 /s2 2 = / = The distribution of test statistic under H 0 : F 77,45 p-value: 2 Pr(F < ) = At significance level 5%, we cannot reject the null hypothesis. There is no evidence that the variances are different. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
28 Overlapping Confidence Intervals and Statistical Significance Can we judge whether two statistics are significantly different depending on whether or not their confidence intervals overlap? The answer is: not always. If two statistics have non-overlapping confidence intervals, they are significantly different. If they have overlapping confidence intervals, it is not necessarily true that they are not significantly different. Yujin Chung Lec6: Statistical hypothesis testings Fall /30
29 Overlapping Confidence Intervals and Statistical Significance Assume x 1 x 2 0 without loss of generality. The means are significantly different if p (x 1 x 2 ) > 1.96 SE12 + SE22 CIs do not overlap if x SE1 > x SE2 which implies (x1 x2 ) > 1.96(SE1 + SE2 ) p Since SE12 + SE22 SE1 + SE2, Yujin Chung Lec6: Statistical hypothesis testings Fall /30
30 Summary 1 What are your hypotheses? 2 Identify data type and test statistic t-test, z-test, χ 2 -test, F-test one-sample or two-sample (paired or independent) 3 Perform a test 4 Go back to numerical and/or graphical summary and confirm your test result matches your data Yujin Chung Lec6: Statistical hypothesis testings Fall /30
AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationCOMPARING GROUPS PART 1CONTINUOUS DATA
COMPARING GROUPS PART 1CONTINUOUS DATA Min Chen, Ph.D. Assistant Professor Quantitative Biomedical Research Center Department of Clinical Sciences Bioinformatics Shared Resource Simmons Comprehensive Cancer
More informationPHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1
PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population
More informationTwo sample hypothesis testing
Statistics February 26, 2014 Debdeep Pati Two sample hypothesis testing 1. Suppose we want to study the relationship between use of oral contraceptives (OC) and level of blood pressure (BP) in women. 2.
More informationBusiness Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing
Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology
More informationOne-sample categorical data: approximate inference
One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution
More informationFirst we look at some terms to be used in this section.
8 Hypothesis Testing 8.1 Introduction MATH1015 Biostatistics Week 8 In Chapter 7, we ve studied the estimation of parameters, point or interval estimates. The construction of CI relies on the sampling
More information7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between
7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation
More information1 Statistical inference for a population mean
1 Statistical inference for a population mean 1. Inference for a large sample, known variance Suppose X 1,..., X n represents a large random sample of data from a population with unknown mean µ and known
More informationThe t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary
Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis
More informationEpidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval
Epidemiology 9509 Principles of Biostatistics Chapter 10 - Inferences about John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. differences in
More informationPubH 5450 Biostatistics I Prof. Carlin. Lecture 13
PubH 5450 Biostatistics I Prof. Carlin Lecture 13 Outline Outline Sample Size Counts, Rates and Proportions Part I Sample Size Type I Error and Power Type I error rate: probability of rejecting the null
More informationClass 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving
More informationNormal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT):
Lecture Three Normal theory null distributions Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT): A random variable which is a sum of many
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationE509A: Principle of Biostatistics. GY Zou
E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What
More informationEC2001 Econometrics 1 Dr. Jose Olmo Room D309
EC2001 Econometrics 1 Dr. Jose Olmo Room D309 J.Olmo@City.ac.uk 1 Revision of Statistical Inference 1.1 Sample, observations, population A sample is a number of observations drawn from a population. Population:
More information7 Estimation. 7.1 Population and Sample (P.91-92)
7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for
More informationCHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:
CHAPTER 8 Test of Hypotheses Based on a Single Sample Hypothesis testing is the method that decide which of two contradictory claims about the parameter is correct. Here the parameters of interest are
More informationAn inferential procedure to use sample data to understand a population Procedures
Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors
More informationHypothesis Testing. ECE 3530 Spring Antonio Paiva
Hypothesis Testing ECE 3530 Spring 2010 Antonio Paiva What is hypothesis testing? A statistical hypothesis is an assertion or conjecture concerning one or more populations. To prove that a hypothesis is
More informationTwo-Sample Inferential Statistics
The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is
More informationDr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38
BIO5312 Biostatistics Lecture 11: Multisample Hypothesis Testing II Dr. Junchao Xia Center of Biophysics and Computational Biology Fall 2016 11/8/2016 1/38 Outline In this lecture, we will continue to
More informationSTAT 4385 Topic 01: Introduction & Review
STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 Outline Welcome What is Regression Analysis? Basics
More informationStatistics for IT Managers
Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample
More informationLECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING
LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation
More informationIntroduction to Statistical Data Analysis III
Introduction to Statistical Data Analysis III JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? The
More informationCH.9 Tests of Hypotheses for a Single Sample
CH.9 Tests of Hypotheses for a Single Sample Hypotheses testing Tests on the mean of a normal distributionvariance known Tests on the mean of a normal distributionvariance unknown Tests on the variance
More informationCHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:
CHAPTER 9, 10 Hypothesis Testing Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities: The person is guilty. The person is innocent. To
More informationSTAT 135 Lab 5 Bootstrapping and Hypothesis Testing
STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,
More informationPower of a test. Hypothesis testing
Hypothesis testing February 11, 2014 Debdeep Pati Power of a test 1. Assuming standard deviation is known. Calculate power based on one-sample z test. A new drug is proposed for people with high intraocular
More informationSummary of Chapters 7-9
Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two
More informationTA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM
STAT 301, Fall 2011 Name Lec 4: Ismor Fischer Discussion Section: Please circle one! TA: Sheng Zhgang... 341 (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan... 345 (W 1:20) / 346 (Th
More informationEconomics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing
Economics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing Eric Zivot October 12, 2011 Hypothesis Testing 1. Specify hypothesis to be tested H 0 : null hypothesis versus. H 1 : alternative
More informationSampling Distributions: Central Limit Theorem
Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)
More informationChapter 9 Inferences from Two Samples
Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review
More informationSTAT 515 fa 2016 Lec Statistical inference - hypothesis testing
STAT 515 fa 2016 Lec 20-21 Statistical inference - hypothesis testing Karl B. Gregory Wednesday, Oct 12th Contents 1 Statistical inference 1 1.1 Forms of the null and alternate hypothesis for µ and p....................
More informationChapter 5: HYPOTHESIS TESTING
MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate
More informationThe t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18
and t-tests Patrick Breheny October 15 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 Introduction The t-test As we discussed previously, W.S. Gossett derived the t-distribution as a way of
More informationSampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =
2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationChapter 9. Hypothesis testing. 9.1 Introduction
Chapter 9 Hypothesis testing 9.1 Introduction Confidence intervals are one of the two most common types of statistical inference. Use them when our goal is to estimate a population parameter. The second
More informationHarvard University. Rigorous Research in Engineering Education
Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationProbability and Statistics Notes
Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline
More informationBINF 702 SPRING Chapter 8 Hypothesis Testing: Two-Sample Inference. BINF702 SPRING 2014 Chapter 8 Hypothesis Testing: Two- Sample Inference 1
BINF 702 SPRING 2014 Chapter 8 Hypothesis Testing: Two-Sample Inference Two- Sample Inference 1 A Poster Child for two-sample hypothesis testing Ex 8.1 Obstetrics In the birthweight data in Example 7.2,
More informationHypothesis testing: theory and methods
Statistical Methods Warsaw School of Economics November 3, 2017 Statistical hypothesis is the name of any conjecture about unknown parameters of a population distribution. The hypothesis should be verifiable
More information1; (f) H 0 : = 55 db, H 1 : < 55.
Reference: Chapter 8 of J. L. Devore s 8 th Edition By S. Maghsoodloo TESTING a STATISTICAL HYPOTHESIS A statistical hypothesis is an assumption about the frequency function(s) (i.e., pmf or pdf) of one
More informationBIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation
BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation Yujin Chung November 29th, 2016 Fall 2016 Yujin Chung Lec13: MLE Fall 2016 1/24 Previous Parametric tests Mean comparisons (normality assumption)
More informationExam 2 (KEY) July 20, 2009
STAT 2300 Business Statistics/Summer 2009, Section 002 Exam 2 (KEY) July 20, 2009 Name: USU A#: Score: /225 Directions: This exam consists of six (6) questions, assessing material learned within Modules
More informationLecture 7: Hypothesis Testing and ANOVA
Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis
More informationPower and Sample Size Bios 662
Power and Sample Size Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-10-31 14:06 BIOS 662 1 Power and Sample Size Outline Introduction One sample: continuous
More informationTests about a population mean
October 2 nd, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence
More informationSmoking Habits. Moderate Smokers Heavy Smokers Total. Hypertension No Hypertension Total
Math 3070. Treibergs Final Exam Name: December 7, 00. In an experiment to see how hypertension is related to smoking habits, the following data was taken on individuals. Test the hypothesis that the proportions
More informationProbability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!
Probability theory and inference statistics Dr. Paola Grosso SNE research group p.grosso@uva.nl paola.grosso@os3.nl (preferred) Roadmap Lecture 1: Monday Sep. 22nd Collecting data Presenting data Descriptive
More informationUnit 9 ONE Sample Inference SOLUTIONS
BIOSTATS 540 Fall 017 Introductory Biostatistics Solutions Unit 9 ONE Sample Inference SOLUTIONS 1. This exercise gives you practice calculating a confidence interval for the mean of a Normal distribution
More informationStatistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts
Statistical methods for comparing multiple groups Lecture 7: ANOVA Sandy Eckel seckel@jhsph.edu 30 April 2008 Continuous data: comparing multiple means Analysis of variance Binary data: comparing multiple
More informationFrequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=
A frequency distribution is a kind of probability distribution. It gives the frequency or relative frequency at which given values have been observed among the data collected. For example, for age, Frequency
More informationLECTURE 5 HYPOTHESIS TESTING
October 25, 2016 LECTURE 5 HYPOTHESIS TESTING Basic concepts In this lecture we continue to discuss the normal classical linear regression defined by Assumptions A1-A5. Let θ Θ R d be a parameter of interest.
More informationThe Components of a Statistical Hypothesis Testing Problem
Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one
More informationUnit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users
BIOSTATS 640 Spring 2017 Review of Introductory Biostatistics STATA solutions Page 1 of 16 Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users #1. The following table lists length of
More informationStatistical Inference: Estimation and Confidence Intervals Hypothesis Testing
Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire
More informationInferences About Two Population Proportions
Inferences About Two Population Proportions MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018 Background Recall: for a single population the sampling proportion
More informationMedical statistics part I, autumn 2010: One sample test of hypothesis
Medical statistics part I, autumn 2010: One sample test of hypothesis Eirik Skogvoll Consultant/ Professor Faculty of Medicine Dept. of Anaesthesiology and Emergency Medicine 1 What is a hypothesis test?
More informationBINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference
BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference BINF702 SPRING 2014 Chapter 7 Hypothesis Testing 1 Section 7.9 One-Sample c 2 Test for the Variance of a Normal Distribution Eq. 7.40
More informationSection 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples
Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationTopic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing. Road Map Sampling Distributions, Confidence Intervals & Hypothesis Testing
Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing ECO22Y5Y: Quantitative Methods in Economics Dr. Nick Zammit University of Toronto Department of Economics Room KN3272 n.zammit
More informationTables Table A Table B Table C Table D Table E 675
BMTables.indd Page 675 11/15/11 4:25:16 PM user-s163 Tables Table A Standard Normal Probabilities Table B Random Digits Table C t Distribution Critical Values Table D Chi-square Distribution Critical Values
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationIntroduction to Statistics
MTH4106 Introduction to Statistics Notes 15 Spring 2013 Testing hypotheses about the mean Earlier, we saw how to test hypotheses about a proportion, using properties of the Binomial distribution It is
More informationLab #12: Exam 3 Review Key
Psychological Statistics Practice Lab#1 Dr. M. Plonsky Page 1 of 7 Lab #1: Exam 3 Review Key 1) a. Probability - Refers to the likelihood that an event will occur. Ranges from 0 to 1. b. Sampling Distribution
More informationThe t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies
The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit
More informationLecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1
Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. 9.1-1 Chapter 9 Inferences
More informationLECTURE 5. Introduction to Econometrics. Hypothesis testing
LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will
More informationMathematical Statistics
Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics
More informationTwo Sample Problems. Two sample problems
Two Sample Problems Two sample problems The goal of inference is to compare the responses in two groups. Each group is a sample from a different population. The responses in each group are independent
More informationIntroductory Econometrics. Review of statistics (Part II: Inference)
Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing
More informationReview: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.
1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately
More informationChapter 10: Inferences based on two samples
November 16 th, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence
More informationLab #11. Variable B. Variable A Y a b a+b N c d c+d a+c b+d N = a+b+c+d
BIOS 4120: Introduction to Biostatistics Breheny Lab #11 We will explore observational studies in today s lab and review how to make inferences on contingency tables. We will only use 2x2 tables for today
More informationNature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference.
Understanding regression output from software Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals In 1966 Cyril Burt published a paper called The genetic determination of differences
More informationContents. 22S39: Class Notes / October 25, 2000 back to start 1
Contents Determining sample size Testing about the population proportion Comparing population proportions Comparing population means based on two independent samples Comparing population means based on
More informationLast two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals
Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last two weeks: Sample, population and sampling
More informationWelcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT)
. Welcome! Webinar Biostatistics: sample size & power Thursday, April 26, 12:30 1:30 pm (NDT) Get started now: Please check if your speakers are working and mute your audio. Please use the chat box to
More informationVisual interpretation with normal approximation
Visual interpretation with normal approximation H 0 is true: H 1 is true: p =0.06 25 33 Reject H 0 α =0.05 (Type I error rate) Fail to reject H 0 β =0.6468 (Type II error rate) 30 Accept H 1 Visual interpretation
More informationLecture 5: ANOVA and Correlation
Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions
More informationStatistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong
Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong 1 Quick Overview of Statistics 2 Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data
More informationSTAT Chapter 8: Hypothesis Tests
STAT 515 -- Chapter 8: Hypothesis Tests CIs are possibly the most useful forms of inference because they give a range of reasonable values for a parameter. But sometimes we want to know whether one particular
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationQuantitative Methods for Economics, Finance and Management (A86050 F86050)
Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge
More informationz and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests
z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math
More information10.4 Hypothesis Testing: Two Independent Samples Proportion
10.4 Hypothesis Testing: Two Independent Samples Proportion Example 3: Smoking cigarettes has been known to cause cancer and other ailments. One politician believes that a higher tax should be imposed
More informationLecture 3. Biostatistics in Veterinary Science. Feb 2, Jung-Jin Lee Drexel University. Biostatistics in Veterinary Science Lecture 3
Lecture 3 Biostatistics in Veterinary Science Jung-Jin Lee Drexel University Feb 2, 2015 Review Let S be the sample space and A, B be events. Then 1 P (S) = 1, P ( ) = 0. 2 If A B, then P (A) P (B). In
More informationMTMS Mathematical Statistics
MTMS.01.099 Mathematical Statistics Lecture 12. Hypothesis testing. Power function. Approximation of Normal distribution and application to Binomial distribution Tõnu Kollo Fall 2016 Hypothesis Testing
More information1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests
Overall Overview INFOWO Statistics lecture S3: Hypothesis testing Peter de Waal Department of Information and Computing Sciences Faculty of Science, Universiteit Utrecht 1 Descriptive statistics 2 Scores
More informationBIOS 312: Precision of Statistical Inference
and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample
More informationCHAPTER 10 HYPOTHESIS TESTING WITH TWO SAMPLES
CHAPTER 10 HYPOTHESIS TESTING WITH TWO SAMPLES In this chapter our hypothesis tests allow us to compare the means (or proportions) of two different populations using a sample from each population For example,
More informationProblem Set 4 - Solutions
Problem Set 4 - Solutions Econ-310, Spring 004 8. a. If we wish to test the research hypothesis that the mean GHQ score for all unemployed men exceeds 10, we test: H 0 : µ 10 H a : µ > 10 This is a one-tailed
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests 1999 Prentice-Hall, Inc. Chap. 8-1 Chapter Topics Hypothesis Testing Methodology Z Test
More information