Correct: P > Correct: < P < Correct: P = 0.036

Similar documents
Chapter 7 Class Notes Comparison of Two Independent Samples

STAT 135 Lab 8 Hypothesis Testing Review, Mann-Whitney Test by Normal Approximation, and Wilcoxon Signed Rank Test.

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Statistical inference provides methods for drawing conclusions about a population from sample data.

Chapter 7 Comparison of two independent samples

Dealing with the assumption of independence between samples - introducing the paired design.

Distribution-Free Procedures (Devore Chapter Fifteen)

Inferences About the Difference Between Two Means

Ch18 links / ch18 pdf links Ch18 image t-dist table

Wilcoxon Test and Calculating Sample Sizes

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics

MATH Chapter 21 Notes Two Sample Problems

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Simple Linear Regression: One Qualitative IV

Do not copy, post, or distribute. Independent-Samples t Test and Mann- C h a p t e r 13

Introduction to Statistical Data Analysis III

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I

Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)

Visual interpretation with normal approximation

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers

Non-parametric methods

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

Chapter 24. Comparing Means

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

STA 101 Final Review

SPSS Guide For MMI 409

Solutions exercises of Chapter 7

Nonparametric tests, Bootstrapping

Hypothesis testing: Steps

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

Sociology 593 Exam 1 Answer Key February 17, 1995

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Module 9: Nonparametric Statistics Statistics (OA3102)

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

a. When a data set is not normally distributed, what should you try in order to appropriately make statistical tests on that data?

Class 19. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Solution: The graph is certainly not a line, since the variables are squared. Let s plot points and see what we get.

Statistics: revision

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Sampling Distributions: Central Limit Theorem

psychological statistics

Resampling Methods. Lukas Meier

Hypothesis testing. Data to decisions

Hypothesis testing: Steps

Section 1.2: Conditional Statements

20 Hypothesis Testing, Part I

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination

Mitosis Data Analysis: Testing Statistical Hypotheses By Dana Krempels, Ph.D. and Steven Green, Ph.D.

Lecture 7: Hypothesis Testing and ANOVA

3. Nonparametric methods

BINF 702 SPRING Chapter 8 Hypothesis Testing: Two-Sample Inference. BINF702 SPRING 2014 Chapter 8 Hypothesis Testing: Two- Sample Inference 1

STAT Section 5.8: Block Designs

DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence interval to compare two proportions.

Non-parametric tests, part A:

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

CHAPTER 7. Hypothesis Testing

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Analysis of 2x2 Cross-Over Designs using T-Tests

Soc 3811 Basic Social Statistics Second Midterm Exam Spring Your Name [50 points]: ID #: ANSWERS

Design of the Fuzzy Rank Tests Package

Chapter 23. Inference About Means

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Rank-Based Methods. Lukas Meier

Contents 1. Contents

Chapter 9 Inferences from Two Samples

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Experiment 2 Random Error and Basic Statistics

1 Binomial Probability [15 points]

Difference Between Pair Differences v. 2 Samples

The Chi-Square Distributions

Finansiell Statistik, GN, 15 hp, VT2008 Lecture 10-11: Statistical Inference: Hypothesis Testing

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

1 ONE SAMPLE TEST FOR MEDIAN: THE SIGN TEST

The Chi-Square Distributions

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

What is a Hypothesis?

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

Non-parametric Hypothesis Testing

Chapter 12: Estimation

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47

Chapter 23: Inferences About Means

2.1 Linear regression with matrices

Statistics 251: Statistical Methods

Experiment 2 Random Error and Basic Statistics

Chapter 8: Estimating with Confidence

Biostatistics Quantitative Data

Introduction to Statistical Hypothesis Testing

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

CENTRAL LIMIT THEOREM (CLT)

Stat 231 Exam 2 Fall 2013

6.4 Type I and Type II Errors

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

Transcription:

HW 6 SOLUTIONS Distribution Free Tests (two samples) 1. Fill in the blank. In our class, we apply a non parametric test when we have a small data set that does not come from a normal population. In practice, for certain shaped distributions, we might try a first, which may allow us to apply a t based inference. transformation 2. Suppose we have a dependent samples data set. Further suppose that the differences come from a normal distribution. True or False: The dependent samples t test is more powerful than the sign test. TRUE FALSE 3. Consider two samples of size n 1 = 9 and n 2 = 3. Use Table 6 to bracket the P value using a non directional alternative. a. Us = 21 P > 0.146 b. Us = 24 0.036 < P < 0.10 c. Us = 25 P = 0.036 d. Us = 28 P < 0.0091 4. 7.80. Recall the study described in exercise 7.51 on your previous homework. In this study of hypnosis, breathing patterns were observed in an experimental group of subjects and in a control group. The measurements of total ventilation (liters of air per minute per square meter of body area) are shown in the following table. Experimental Control 5.32 4.50 5.60 4.78 5.74 4.79 6.06 4.86 6.32 5.41 6.34 5.70 6.79 6.08 7.18 6.21 n 8 8 Y 6.169 5.291 s 0.621 0.652

a. When you computed a confidence interval using this data on your last homework, you were told to proceed as though the assumptions for validity of a t based inference were checked and deemed acceptable. Use the QQplot to check the normality assumption. Since we are in the independent samples setting, we must look at both QQplots. The QQplot for the experimental group is not perfect, but shows no systematic departure from linearity. The QQplot for the control group shows sytematic departure from the line. The points are making a shape (an "S" shape?). Also, we only have 8 observations in each group, so we cannot use the CLT. Conclusion: There is almost certainly a violation of the normality assumption here. t distribution inference will not be valid. b. Conduct the appropriate test to see if ventilation tends to be higher in the "to be hypnotized" group. Operate at α = 0.05. The normality assumption necessary for t based inference was not met, so we will use the non parametric test we learned when comparing independent samples the Wilcoxon Mann Whitney. (2) H 0 : Ventilation is the same for the to be hypnotized and the control conditions H A : Ventilation tends to be higher for the to be hypnotized condition (3 )K1 = 4 + 5 + 6 + 6 + 8 + 8 + 8 + 8 = 53 K2 = 0 + 0 + 0 + 0 +01 + 2 + 4 + 4 = 11 Us = max{53, 11} = 53 (4) The shift in the data is in the direction predicted by HA. With n = n' = 8, ½ (0.021) < P < ½ (0.050) 0.0105 < P < 0.025. (5) P < α, reject H0. (6) There is significant evidence to suggest that ventilation rate tends to be higher under the "to be hypnotized" condition than under the "control" condition.

5. 9.53. For an investigation of the mechanism of wound healing, a biologist chose a paired design, using the left and right hindlimbs of the salamander Notophthalmus viridescens. After amputating each limb, she made a small wound in the skin and then kept the limb for 4 hours in either a solution containing benzamil or a control solution. She theorized that the benzamil would impair the healing. The accompanying table shows the amount of healing, expressed as the area (mm 2 ) covered with new skin after 4 hours. Animal 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 Control.55.15.00.13.26.07.20.16.03.42.49.08.32.18.35.03.24 Benzamil.14.08.00.13.10.08.11.00.05.21.11.03.14.37.25.05.16 a. Use the QQplot to check the assumption of normality. Note, this is the dependent samples setting, so we need to look at the QQplot of the differences. There is a systematic departure from the line. The points are making a "U" shape, which indicates a right skew except for that one really low observation, which is not acting like the rest. Anyway, there are only 17 observations, so the CLT may not be appropriate here. There is a probable violation of the normality assumption. b. Use an appropriate hypothesis test to assess the significance of the researcher's hypothesis about the effect of Benzamil. Operate at the α = 0.05 significance level. Since there is a violation of the normality assumption and we have dependent samples, we'll use the sign test. (2) H0: Healing is the same with or without Benzamil present HA: Healing tends to be impared when Benzamil is present (3) N+ = 11, N = 4, Bs = 11 (4) Eliminating the two pairs with d = 0, nd = 15. TI 84 P = Pr{B 11} = 1 Pr{B 10} = 1 binomcdf(15, 0.5, 10) = 0.0592 (5) P > α, so fail to reject H0 (6) There is not significant evidence to conclude that healing tends to be impaired when Benzamil is present.

6. 7.84. Human beta endorphin (HBE) is a hormone secreted by the pituitary gland under conditions of stress. An exercise physiologist measured the resting (unstressed) blood concentration of HBE in two groups of men: Group 1 consisted of 11 men who had been jogging regularly for some time, and group 2 consisted of 15 men who had just entered a physical fitness program. (This data was from the same research study on HBE we used in your Inference for the Population Mean HW, exercise 6.16.) The results are given in the following table. Joggers 39 40 32 60 19 52 41 32 13 37 28 Fitness Program Entrants 70 47 54 27 31 42 37 41 9 18 33 23 49 59 41 a. Use the QQplot to check the assumption of normality for a t distribution based inference. Since we are in the independent samples setting, we need to look at the QQplots for the joggers and the fitness program entrants. The QQplot for the fitness entrants shows an ever so slight hint of being heavy tailed, but is for the most part linear. The QQplot for the joggers, on the other hand, shows a more systematic departure from the line and we only have 12 observations. Conclusion: The normality assumption is may not be met. b. Conduct the appropriate test to compare the two groups. Operate at the α = 0.05 significance level. The normality assumption necessary for t based inference may not be met, so we will use the non parametric test we learned when comparing independent samples the Wilcoxon Mann Whitney. (2) H 0 : HBE joggers = HBE fitness program entrants H A : HBE joggers HBE fitness program entrants (3 )K 1 = 93.5 K 2 = 71.5 U s = max{93.5, 71.5} = 93.5 (4) With n = 15 and n' = 11, P > 0.198 (5) P > α, fail to reject H0. (6) There is not significant evidence to conclude that resting blood concentration of HBE for joggers tends to be different from that of fitness program entrants. c. Instead of a non directional alternative, suppose the researchers believed the HBE would tend to be higher for the joggers group. What would the P value be for this test? K2 = 71.5 < 93.5 = K1 the data do not deviate in the direction of HA. P > 0.5

7. 9.33. In an investigation of possible brain damage due to alcoholism, an X ray procedure known as a computerized tomography (CT) scan was used to measure brain densities in eleven chronic alcoholics. For each alcoholic, a nonalcoholic control was selected who matched the alcoholic on age, sex, education, and other factors. The brain density measurements on the alcoholics and the matched controls are reported in the accompanying table. Pair Alcoholic Control Difference 1 40.1 41.3 1.2 2 38.5 40.2 1.7 3 36.9 37.4 0.5 4 41.4 46.1 4.7 5 40.6 43.9 3.3 6 42.3 41.9 0.4 7 37.2 39.9 2.7 8 38.6 40.4 1.8 9 38.5 38.6 0.1 10 38.4 38.1 0.3 11 38.1 39.5 1.4 a. Use the QQplot assess whether the normality assumption has been met. Note, this is the dependent samples setting, so we need to look at the QQplot of the differences. Even though it is slight, there is a systematic departure from the line. The data points are too low at both the low and high ends end of this distribution to come from a normal population. This is essentially, a skewed left distribution. Also, the CLT won't help us here since there are only 11 data points. This is a probable violation of the assumption of normality. a. Use the appropriate test to test the null hypothesis of no difference against the alternative that alcoholism reduces brain density. Let α = 0.05. (2) H0: Brain density is not different between alcoholics and healthy individuals HA: Alcoholics tend to have lower brain density (3) N+ = 2, N = 9, Bs = 9 (4) nd = 11 TI 84 P = Pr{B 9} = 1 Pr{B 8} = 1 binomcdf(11, 0.5, 8) = 0.0327 (5) P < α, so reject H0 (6) There is significant evidence to conclude that alcoholics tend to have lower brain density. b. Suppose the researcher for this study only wanted to see if there was a significant difference in brain density (i.e. a non directional alternative). What would the P value be for this test? P = 2(0.0327) = 0.0654