Two sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.

Similar documents
Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Confidence Intervals. - simply, an interval for which we have a certain confidence.

Categorical Data Analysis. The data are often just counts of how many things each category has.

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up?

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

Chapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES. Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Dealing with the assumption of independence between samples - introducing the paired design.

Descriptive Statistics (And a little bit on rounding and significant digits)

CHAPTER 9: HYPOTHESIS TESTING

Comparison of Two Population Means

Chapter 7 Class Notes Comparison of Two Independent Samples

# of 6s # of times Test the null hypthesis that the dice are fair at α =.01 significance

Regression, Part I. - In correlation, it would be irrelevant if we changed the axes on our graph.

Math101, Sections 2 and 3, Spring 2008 Review Sheet for Exam #2:

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval

Uni- and Bivariate Power

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Hypothesis Testing: Chi-Square Test 1

Hypothesis testing. Data to decisions

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Confidence intervals

Two sample Hypothesis tests in R.

Chapter 26: Comparing Counts (Chi Square)

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Lab #12: Exam 3 Review Key

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

CBA4 is live in practice mode this week exam mode from Saturday!

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

Hypothesis Testing. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis.

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras. Lecture 11 t- Tests

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?

We need to define some concepts that are used in experiments.

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

- a value calculated or derived from the data.

Statistics for IT Managers

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

Chapter 20 Comparing Groups

CONTENTS OF DAY 2. II. Why Random Sampling is Important 10 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

STAT Chapter 8: Hypothesis Tests

- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size.

Quantitative Understanding in Biology 1.7 Bayesian Methods

Violating the normal distribution assumption. So what do you do if the data are not normal and you still need to perform a test?

Two Sample Problems. Two sample problems

Quadratic Equations Part I

Section 4.6 Negative Exponents

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

DISTRIBUTIONS USED IN STATISTICAL WORK

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Chapter 9: Inferences from Two Samples. Section Title Pages

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

Contrasts and Multiple Comparisons Supplement for Pages

Module 5 Practice problem and Homework answers

Regression and the 2-Sample t

The Difference in Proportions Test

Multiple Testing. Gary W. Oehlert. January 28, School of Statistics University of Minnesota

REVIEW 8/2/2017 陈芳华东师大英语系

Binary Logistic Regression

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

Ch18 links / ch18 pdf links Ch18 image t-dist table

Multiple Regression Analysis

PHY 101L - Experiments in Mechanics

Chapters 4-6: Inference with two samples Read sections 4.2.5, 5.2, 5.3, 6.2

Problem of the Day. 7.3 Hypothesis Testing for Mean (Small Samples n<30) Objective(s): Find critical values in a t-distribution

STAT 350 Final (new Material) Review Problems Key Spring 2016

10.1. Comparing Two Proportions. Section 10.1

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th

Inference for Distributions Inference for the Mean of a Population

Lecture 4: Testing Stuff

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Statistical Inference with Regression Analysis

Rejection regions for the bivariate case

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

COMPARING GROUPS PART 1CONTINUOUS DATA

Difference between means - t-test /25

Sampling Distributions: Central Limit Theorem

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Elementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

Business Statistics. Lecture 5: Confidence Intervals

Testing Research and Statistical Hypotheses

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

Topic 1. Definitions

( ) P A B : Probability of A given B. Probability that A happens

, p 1 < p 2 < < p l primes.

Test of Hypothesis for Small and Large Samples and Test of Goodness of Fit

Mathematical Induction

Psych 230. Psychological Measurement and Statistics

9 Correlation and Regression

Transcription:

Two sided, two sample t-tests. I. Brief review: 1) We are interested in how a sample compares to some pre-conceived notion. For example: a) IQ = 100 b) Average height for men = 5 10. c) Average number of white blood cells per cubic millimeter is 7,000. ) We frame our question in terms of a hypothesis: a) H 0 : mean(iq) = 100 b) H 0 : mean (height for men) = 5 10. c) H 0 : mean (# of white blood cells) = 7,000. (in all cases, our hypothesis is about the POPULATION mean). 3) We develop an alternative hypothesis: a) H 1 : mean (IQ) 100 etc. ) We decide on what level of α = Pr{type I error} we are willing to live with (e.g., α =. 05). 5) We calculate a t value from our data: t * = y s n 6) We look up a t value from our t-tables or from the computer: t, where = degrees of freedom 7) We compare our t ( = t * or t s ) to the tabulated t: if t * we reject H 0 if t * we fail to reject H 0 8) We then proceed with our study accordingly.

II. The two-sampled t-test. 1) Unfortunately, you ll discover that the one-sampled t-test is not terribly useful in biology. Usually we don t have a pre-conceived notion of what μ might be (though quite often we might be interested in putting a confidence interval around some result). ) But what we often do: a) Compare two groups - say males and females. b) See if these two groups have different responses, or differ significantly about something. c) For example: i) Group A gets a new medicine. Group B gets a placebo. Does the new medicine work?? ii) Are men the same average height as women? iii) Do lions and tigers have the same average litter size? (Yes, this isn t normal, but we ll worry about that later). d) This kind of situation develops quite often in biology. ) So how do we proceed? Well, let s start by developing hypotheses for the above examples: a) H 0 : mean (group A) = mean (group B) (we didn t say what we re measuring here) b) H 0 : mean (height for men) = mean (height for women) c) H 0 : mean (litter size for lions) = mean (litter size for tigers) 3) How is this different from our one-sampled test? a) we re comparing two means. In the one sample test we compared one mean with a preconceived idea b) note that again we re interested in the population means, but we re using our samples to get at the information we want. ) So now let s set up our alternative hypotheses: a) H 1 : mean (group A) mean (group B) Comment: yes, we could do: H 1 : mean (group A) < mean(group B)

We will look at one sided tests soon. 5) Then we decide on α. 6) We calculate our t-star. But what is our t * now??? - we have two estimates for the standard error! One from our first sample, one from our second sample. - we also have two sample means floating around. - Question: are we willing to believe that the standard errors in our samples are really the same?? - Let s assume for the moment that we don t think the two standard errors are the same. - Like with the one-sample test, we put a difference in the numerator, and our standard error in the denominator. We just need to figure out our difference, and then our standard error. - Instead of ( y - μ), we now use ( y 1 y ) (technically, it s y 1 1 y, but since μ 1 = μ (our H 0 says μ 1 = μ ), the μ's cancel out. - Instead of our single standard error, we now find some way of averaging our two standard errors and put this in the denominator. - Here s the formula: y t * = 1 y s 1 s n 1 n - That, hopefully makes sense. You can see that the denominator is a kind of average between our two estimates for the standard error. 7) Now we look up our tabulated t-value. a) Problem! How many degrees of freedom do we have?? (what is ν?) b) As it turns out, it s not an easy answer. It does turn out, though that: min(n 1-1, n -1) < df < (n 1 + n - )

which unfortunately isn t very useful (if you need a quick and dirty estimate, go with the min(n 1-1, n -)) - that will give you a conservative estimate. c) What is it really?? = SE 1 SE SE 1 n 1 1 SE n 1 (OUCH!) d) Most statistical software will automatically do this for you, and so will some calculators [but you want to verify your calculator actually uses this before you start spitting out answers!]. e) so now we can look up our tabulated t with the appropriate degrees of freedom and our pre-determined value of α. 8) The last step is then identical to what we did before: if t* we reject H 0 if t * we fail to reject H 0 9) Note: we ll take a look at how things work if we can assume equal variances next time. 10) So here s an example, exercise 7. on p. 9 [7.9, p. ] {7..7, p. 3}. a) (a kind of morbid example). Amines have been suspected of being involved in spasms of the coronary arteries. So some researchers measured the levels of seratonin (an amine) in patients who died of heart disease and then compared these levels to those in patients who did not die of heart disease (died due to other causes). Let s look at the results: Heart disease No heart disease n 8 1 y bar 3,80 5,310 SE 850 60 In the text, they calculate a lot of the stuff for you, but let s see how they did it:

y t * = 1 y s 1 s n 1 n = 3,80 5,310 SE 1 SE 1,70 = 850 60 = 1,70 1,06 = 1.38 We can see where they got the Standard Error of ( y 1 y ), and also the calculated t-value (reported in the answer section in the back of the book). Now we need to figure out our tabulated t-value. For α the book says to use.05 (in each tail), and our degrees of freedom are: = df = SE 1 SE SE 1 n 1 1 SE n 1 = 850 60 850 7 60 11 1.817 10 1 = = 1.7 7.57 10 10 10 1.55 10 So we look up the tabulated t-value: t 1,.05 = 1.761 And since: t * t 1,.05 Note: we always round down our degrees of freedom. For example, 1.98 = 1 degrees of freedom. Also, we always pick the lower number for degrees of freedom from the t-table if our number (after rounding down) isn't in the table.) We conclude that we don t have evidence that seratonin levels are different in patients with heart disease compared to those with no heart disease. Incidentally, we should have explicitly stated: H 0 : mean seratonin level (heart disease) = mean seratonin level (no heart disease) H 1 : mean1 mean (you can fill in the exact wording)

Sometimes we get lazy and simply say H 0 : μ 1 = μ and H 1 : μ 1 μ, but on problems and exams you should always write out the exact H 0 and H 1 you're using. 11) Also look at examples 7.10 through 7.1 {7.. through 7..5}. Note: [7.1] {7..5} are different in the nd vs. 3 rd or th editions, but they cover the same material. If you're not sure what's going on, go through these examples carefully!