One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

Similar documents
Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Confidence intervals

Violating the normal distribution assumption. So what do you do if the data are not normal and you still need to perform a test?

Dealing with the assumption of independence between samples - introducing the paired design.

Descriptive Statistics (And a little bit on rounding and significant digits)

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Two sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

CHAPTER 9: HYPOTHESIS TESTING

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!

Chapter 27 Summary Inferences for Regression

Categorical Data Analysis. The data are often just counts of how many things each category has.

Confidence Intervals. - simply, an interval for which we have a certain confidence.

appstats27.notebook April 06, 2017

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up?

Regression, Part I. - In correlation, it would be irrelevant if we changed the axes on our graph.

Math101, Sections 2 and 3, Spring 2008 Review Sheet for Exam #2:

5.2 Infinite Series Brian E. Veitch

Chapter 1 Review of Equations and Inequalities

Elementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing

Correlation and regression

( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of

Quadratic Equations Part I

Regression I - the least squares line

Calculus II. Calculus II tends to be a very difficult course for many students. There are many reasons for this.

Warm-up Using the given data Create a scatterplot Find the regression line

Stat 20 Midterm 1 Review

Final Review Sheet. B = (1, 1 + 3x, 1 + x 2 ) then 2 + 3x + 6x 2

Statistics: revision

a. See the textbook for examples of proving logical equivalence using truth tables. b. There is a real number x for which f (x) < 0. (x 1) 2 > 0.

STA Module 10 Comparing Two Proportions

Chapter 26: Comparing Counts (Chi Square)

2. l = 7 ft w = 4 ft h = 2.8 ft V = Find the Area of a trapezoid when the bases and height are given. Formula is A = B = 21 b = 11 h = 3 A=

Algebra Exam. Solutions and Grading Guide

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

Expansion of Terms. f (x) = x 2 6x + 9 = (x 3) 2 = 0. x 3 = 0

CS 301. Lecture 18 Decidable languages. Stephen Checkoway. April 2, 2018

PHY 101L - Experiments in Mechanics

Vectors. A vector is usually denoted in bold, like vector a, or sometimes it is denoted a, or many other deviations exist in various text books.

Algebra & Trig Review

Multiple Regression Analysis

εx 2 + x 1 = 0. (2) Suppose we try a regular perturbation expansion on it. Setting ε = 0 gives x 1 = 0,

Algebra Year 9. Language

Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS.

Chapter 1. Foundations of GMAT Math. Arithmetic

CS 124 Math Review Section January 29, 2018

[Disclaimer: This is not a complete list of everything you need to know, just some of the topics that gave people difficulty.]

AP Calculus AB Summer Assignment

CHAPTER 7: TECHNIQUES OF INTEGRATION

Line Integrals and Path Independence

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Chapter 5: HYPOTHESIS TESTING

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Introduction. So, why did I even bother to write this?

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

MITOCW ocw-18_02-f07-lec02_220k

Testing Research and Statistical Hypotheses

Bell s spaceship paradox

- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size.

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Where Is Newton Taking Us? And How Fast?

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.

Math 144 Summer 2012 (UCR) Pro-Notes June 24, / 15

Lecture 11: Extrema. Nathan Pflueger. 2 October 2013

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Systems of Linear Equations and Inequalities

#29: Logarithm review May 16, 2009

Physic 602 Conservation of Momentum. (Read objectives on screen.)

Section 4.6 Negative Exponents

Basics of Proofs. 1 The Basics. 2 Proof Strategies. 2.1 Understand What s Going On

!t + U " #Q. Solving Physical Problems: Pitfalls, Guidelines, and Advice

Math 31 Lesson Plan. Day 2: Sets; Binary Operations. Elizabeth Gillaspy. September 23, 2011

LECTURE 15: SIMPLE LINEAR REGRESSION I

Systematic Uncertainty Max Bean John Jay College of Criminal Justice, Physics Program

Algebra Year 10. Language

Math 5a Reading Assignments for Sections

Linear Classifiers and the Perceptron

Modern Algebra Prof. Manindra Agrawal Department of Computer Science and Engineering Indian Institute of Technology, Kanpur

Uncertainty. Michael Peters December 27, 2013

Section 20: Arrow Diagrams on the Integers

Physics 6A Lab Experiment 6

Science. Science. Passage I

AP Calculus AB Summer Assignment

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Sec. 1 Simplifying Rational Expressions: +

Algebra 8.6 Simple Equations

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means

Lecture 17: Small-Sample Inferences for Normal Populations. Confidence intervals for µ when σ is unknown

Remember that all physical quantities are measured in units. The unit of force is called the newton (N), where 1 N = (1 kg)(1 m/s 2 ).

ASTRO 114 Lecture Okay. We re now gonna continue discussing and conclude discussing the entire

Lecture 4: Constructing the Integers, Rationals and Reals

Rainfall, Part IV. Rain does not normally accumulate above this level

Chapter 14. From Randomness to Probability. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Solution to Proof Questions from September 1st

Section 2.7 Solving Linear Inequalities

MATH 22 FUNCTIONS: ORDER OF GROWTH. Lecture O: 10/21/2003. The old order changeth, yielding place to new. Tennyson, Idylls of the King

Transcription:

One sided tests So far all of our tests have been two sided. While this may be a bit easier to understand, this is often not the best way to do a hypothesis test. One simple thing that we can do to get a more powerful test is to do a one sided test. One sided tests are also called one tailed or directional. An example of a two sided alternative is what we ve been using for our two sample tests: H 1 : µ 1 µ 2 Notice that this uses the symbol. We reject if we think µ 1 is bigger than µ 2 or if we think µ 1 is smaller than µ 2. A one sided test uses only one of these alternatives, and our alternative hypothesis becomes: H 1 : µ 1 > µ 2 or H 1 : µ 1 < µ 2 (one or the other, not both!) How do we know when to do a one sided test? There are two ways that we can decide to do a one sided test: 1. We know something about the subject matter. Suppose we try to figure out if there is a difference in height between men and women. If µ 1 = the true average height of women and µ 2 = the true average height of men, what outcome do we expect? We expect men to be taller. So we can choose our alternative hypothesis as follows: H 1 : µ 1 < µ 2 2. Logic tells us something about the problem at hand. Suppose we test a new medication to reduce blood pressure. If If µ 1 = the true average systolic blood pressure without medication and µ 2 = the true average systolic blood pressure with medication, what do we expect to happen? We expect blood pressure to drop after taking medication, so we would choose our alternative hypothesis as follows: H 1 : µ 1 > µ 2 We use subject matter knowledge or logic to decide if we should do a one sided test. Note that we never (never!) decide to use a one sided test by comparing ȳ 1 with ȳ 2 : This is WRONG: if ȳ 1 < ȳ 2 then do H 1 : µ 1 < µ 2 The reason we don t do this is that a one sided test gives you more power. You are only entitled to this extra power if you can decide ahead of time to do a one sided test. We ll look more closely at power below. Now that we know a little about how to decide whether or not to do a one sided test, we need to figure out how to actually carry out a one sided test. The good news is that almost all of our math that we ve learned so far stays the same. A one sided two sample T test, for 1

One sided tests 2 example, uses the same calculations as the two sided test. However, there are two things we need to watch out for: We need to make sure our data agree with the alternative hypothesis. We need to pull out the correct value from the tables. Let s use a two sample T test as an example and outline this (the below indicates the parts you need to do differently): 1. Set up your hypotheses: H 1 : µ 1 > µ 2 (or H 1 : µ 1 < µ 2 ) 2. Pick your value for α 3. Verify that your data agree with your alternative hypothesis: For example, if your alternative hypothesis is H 1 : µ 1 < µ 2, you need to make sure that ȳ 1 < ȳ 2 (or vice versa). If this is not true, STOP: Don t do anything else. Don t continue your calculations. Your results contradict your alternative hypothesis. You automatically fail to reject. There might also be something screwy with your data. If you forget this step, you could make very bad mistakes! 4. Calculate t in the usual way. Remember, there are several different versions of the T test; just do everything the same as usual. Don t forget the assumptions of your test (normal distribution, etc.). 5. Make your comparison in the usual way: Make sure you use the one sided value for t table from the tables. If t t table reject H 0, otherwise fail to reject H 0. This is not the traditional way of doing one sided tests T tests. Usually you take note of the sign (+ or ) of t, and then compare that to the tabulated value, which may be negative (you may need to change the sign of the table value). Although this does get rid of the requirement of having to check if your data agree with H 1, it is also rather more confusing. Let s do an example. In general, the larger the animal, the slower the heart rate. For example, the average heart rate of an elephant (about 30) is much lower than that for

One sided tests 3 humans (about 70). You want to compare the heart rates between rats and mice. Based on what you just read, who should have the higher heart rate (what do we expect to happen)? If we let µ 1 =the true average hear rates of rats and µ 2 =the true average heart rate of mice, what do we expect to happen? This is what we put in our alternative hypothesis: H 1 : µ 1 < µ 2 this is what we expect to happen. So now let s proceed. We ll pick α = 0.10. Here are the data: ȳ s SE Heart rate in rats: 478 490 563 617 371 503.8 93.29 41.72 Heart rate in mice: 788 619 365 814 655 648.2 178.98 80.04 Let s make sure our data agree with our alternative hypothesis: We note that ȳ 1 = 503.8, and that ȳ 2 = 648.2. Our alternative hypothesis is that µ 1 < µ 2, and since ȳ 1 < ȳ 2 our data agree with the alternative so we continue. Let s calculate t : t = 508.8 648.2 93.29 2 + 178.982 5 5 = 1.5998 = t = 1.5998 And then figure out our degrees of freedom: d.f. = ν = (41.722 + 80.04 2 ) 2 41.72 4 + 80.044 4 4 = 6.042 So we use d.f. =6 (we round down). Now we go to our T tables and look up the one sided value: t table = t 0.10,6 = 1.440 And since t = 1.5998 t 0.10,6 = 1.440, we reject our null hypothesis and conclude that the heart rates in rats are slower than those in mice.

One sided tests 4 So we ve learned how to do a one sided or directional test. Why can t we just stick with two sided tests? Because one sided tests have more power. Let s do our example again, but this time as a two sided test: H 1 : µ 1 µ 2 We pick the same α, so α = 0.10. We don t have to make sure our data agree with anything, so we can proceed directly to calculating t. We don t actually have to calculate this since we already did the calculation for both t and d.f. above. So we look up our t table value, but since we re now doing a two sided test, we pick: t table = t 0.10,6 = 1.948 If we now do our comparison, we find that t = 1.5998 < t 0.10,6 = 1.948, and we fail to reject our null hypothesis. If we do a one sided test, we can see a difference in heart rates, if we do a two sided test, we can not find a difference. We lost power by doing a two sided test. Let s look at a picture to see what s going on: Two tailed test requires 5% in each tail (uses a critical t value of 1.948) One tailed test puts 10% in one tail (uses a critical t value of 1.440) 5% of area 90% of area 5% of area 90% of area 10% of area (where we fail to reject) (where we fail to reject) 1.948 1.948 4 0 4 1.440 4 0 4 t with 6 d.f. t with 6 d.f.

One sided tests 5 A one sided test lets us put all of the area on one side, and this lets us use a smaller value of t table, so it makes it easier to reject (and gives us more power). While it isn t true for all tests, many other tests can be done using a one sided alternative hypothesis. We ll learn some of these in the next few chapters. In the meantime, let s take a quick look at how to do a Mann-Whitney U test that is one sided: Develop your hypotheses (just as you would for the one-tailed T test). Decide on α. Calculate U. You still need to verify that your data agree with H 1. Since the MWU test doesn t always use means in it s hypotheses, what do you do? You use K 1 and K 2. For example, if H 1 : µ 1 < µ 2, then you make sure that K 1 < K 2. The same thing applies if you re doing H 1 : D 1 < D 2 ; make sure that K 1 < K 2. Now look up your U table value, but make sure you use the values for one sided probabilities. Remember that the MWU tables are laid out so that each table is for a single value of α, so make sure you use the one sided values. Your comparison is the same as always. if U U table reject the null hypothesis. Finally, some comments on one sided tests: If you can do a one sided test you should. It gives you more power, and as a statistician you should always use the best (i.e., most powerful) test available. But, remember, you should have a reason for choosing a one sided test before you carry out your test. Preferably you should actually design your experiment with a one sided test in mind if at all possible.