# of 6s # of times Test the null hypthesis that the dice are fair at α =.01 significance

Similar documents
AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL - MAY 2005 EXAMINATIONS STA 248 H1S. Duration - 3 hours. Aids Allowed: Calculator

Chapter 9 Inferences from Two Samples

This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables in your book.

Chapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES. Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups

( ) P A B : Probability of A given B. Probability that A happens

their contents. If the sample mean is 15.2 oz. and the sample standard deviation is 0.50 oz., find the 95% confidence interval of the true mean.

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual

STAT 201 Assignment 6

This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

2.57 when the critical value is 1.96, what decision should be made?

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

1. Let X be a random variable with probability density function. 1 x < f(x) = 0 otherwise

Point Estimation and Confidence Interval

Inference for Proportions, Variance and Standard Deviation

December 2010 Mathematics 302 Name Page 2 of 11 pages

Lecture 41 Sections Mon, Apr 7, 2008

Chapter 9. Hypothesis testing. 9.1 Introduction

Testing a Claim about the Difference in 2 Population Means Independent Samples. (there is no difference in Population Means µ 1 µ 2 = 0) against

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

Math 10 - Compilation of Sample Exam Questions + Answers

2.4. Conditional Probability

Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences h, February 12, 2015

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Math 493 Final Exam December 01

Math 2200 Fall 2014, Exam 3 You may use any calculator. You may use a 4 6 inch notecard as a cheat sheet.

Two sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.

STA 2101/442 Assignment 2 1

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

ANOVA - analysis of variance - used to compare the means of several populations.

CONTINUOUS RANDOM VARIABLES

December 2010 Mathematics 302 Name Page 2 of 11 pages

4. Suppose that we roll two die and let X be equal to the maximum of the two rolls. Find P (X {1, 3, 5}) and draw the PMF for X.

This does not cover everything on the final. Look at the posted practice problems for other topics.

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Probability and Samples. Sampling. Point Estimates

CHAPTER 10 HYPOTHESIS TESTING WITH TWO SAMPLES

1. A machine produces packets of sugar. The weights in grams of thirty packets chosen at random are shown below.

Estadística I Exercises Chapter 4 Academic year 2015/16

Psych 230. Psychological Measurement and Statistics

UNIVERSITY OF TORONTO Faculty of Arts and Science

Probability and Probability Distributions. Dr. Mohammed Alahmed

18.05 Practice Final Exam

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except on problems 1 & 2. Work neatly.

Chapter. Hypothesis Testing with Two Samples. Copyright 2015, 2012, and 2009 Pearson Education, Inc. 1

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics

Chapter 20 Comparing Groups

Exam III #1 Solutions

Two sample Hypothesis tests in R.

Sampling Distributions: Central Limit Theorem

The t-statistic. Student s t Test

QUIZ 4 (CHAPTER 7) - SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100%

79 Wyner Math Academy I Spring 2016

Section 9 1B: Using Confidence Intervals to Estimate the Difference ( p 1 p 2 ) in 2 Population Proportions p 1 and p 2 using Two Independent Samples

EXAM 3 Math 1342 Elementary Statistics 6-7

Inferences About Two Proportions

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly.

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

WISE International Masters

Homework 10 (due December 2, 2009)

1. The (dependent variable) is the variable of interest to be measured in the experiment.

Mt. Douglas Secondary

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?

QUIZ 1 (CHAPTERS 1-4) SOLUTIONS MATH 119 FALL 2012 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS

MATH 333: Probability & Statistics. Final Examination (Fall 2004) Must show all work to receive full credit. Total. Score #1 # 2 #3 #4 #5 #6 #7 #8

χ test statistics of 2.5? χ we see that: χ indicate agreement between the two sets of frequencies.

Chapter 7. Practice Exam Questions and Solutions for Final Exam, Spring 2009 Statistics 301, Professor Wardrop

Hypothesis testing: Steps

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

Exercises in Probability Theory Paul Jung MA 485/585-1C Fall 2015 based on material of Nikolai Chernov

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Relax and good luck! STP 231 Example EXAM #2. Instructor: Ela Jackiewicz

Example. What is the sample space for flipping a fair coin? Rolling a 6-sided die? Find the event E where E = {x x has exactly one head}

The remains of the course

[ z = 1.48 ; accept H 0 ]

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

Exam 2 (KEY) July 20, 2009

Confidence intervals CE 311S

Sec. 11.3: Hypothesis Tests for 2 Population Means (independent samples) Sec. 11.2: Hypothesis Tests for 2. (dependent samples)

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

Single Sample Means. SOCY601 Alan Neustadtl

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

STAT 501 EXAM I NAME Spring 1999

SMAM 314 Exam 42 Name

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages

Two Sample Problems. Two sample problems

Statistics and Sampling distributions

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. describes the.

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Chapter 5: HYPOTHESIS TESTING

15: CHI SQUARED TESTS

Notes for Math 324, Part 17

Statistics for IT Managers

1 Basic continuous random variable problems

Chapter 6 Continuous Probability Distributions

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

Lecture 10: Comparing two populations: proportions

Transcription:

Practice Final Exam Statistical Methods and Models - Math 410, Fall 2011 December 4, 2011 You may use a calculator, and you may bring in one sheet (8.5 by 11 or A4) of notes. Otherwise closed book. The real test will have about 10 questions. Be sure when you do a hypothesis testing problem that you describe your parameters in terms of the problem when stating H 0 and H a. Also be sure you rephrase you conclusion in terms of the original question being asked in the problem. (1) You have a set of three dice. (a) Calculate the probability of getting 0, 1, 2 and 3 sixes if the dice are fair. (b) Suppose you roll the three dice 100 times. The results are # of 6s # of times 0 47 1 35 2 15 3 3 Test the null hypthesis that the dice are fair at α =.01 significance level. SOLUTION: (a) Note that if K is a binomial random variable with paramter π = 1/6 measuring the probability of a success (a 6). The probability of 0 sixes is (5/6) 3 =.5787. The probability of 1 six is ( ) 3 1 (1/6)(5/6) 2 =.3472. To get 2 sixes, ( 3 2) (1/6) 2 (5/6) =.0694 and to get 3 sixes we have probability (1/6) 3 =.0046. (b) We compute the χ 2 statistic. The table gives us observed values O 0, O 1, O 2 and O 3. We use the probabilities from the previous parts to calculated expected values by multiplying by 100. This gives χ 2 = (57.87 47)2 57.87 + (34.72 35)2 34.72 + (6.94 15)2 6.94 + (.46 3)2.46 = 25.43. (i) H 0 is that the distribution of the count of sixes be as given by the probabilities from the answer to part a. (ii) H a is that the distribution of sixes is otherwise. (iii) The significance level is.01 as given. (iv) The test statistic is χ 2 = 25.43. (v) The p-value will be the probability that (given H 0 ) χ 2 would be greater than 25.43. We have 4 classes, so 3 1

2 degrees of freedom. The associated p-value is very small (well under α). (vi) We reject H 0, concluding that the number of sixes on rolling our three dice is not represented by the probabilities described in the previous part, at the.01 significance level. (2) Males and females were asked about what they would do if they received a $100 bill by mail, addressed to their neighbor, but wrongly delivered to them. Would they return it to their neighbor? 69 men were sampled, of whom 52 said yes. 131 females were sampled, of whom 120 yes. Does the data indicate that the proportions that said yes are different for male and female at a 5% level of significance? SOLUTION:Let π 1 be the proportion of men who would return the $100 and π 2 be the proportion of women who would return the $100. p 1 = 52/69 =.7536 is the proportion of men from our sample who would return the $100. p 2 = 120/131 =.916 is the proportion of women from our sample who would return the $100. (a) H 0 : π 1 = π 2. (b) H a : π 1 π 2. (c) α =.05. (d) We make a test z statistic using our pooled population proportion p = 52+120.885.115( 1 131 + 1 69+131 = 177 200 69 ) =.0475. =.885. The standard error is z = p 1 p 2 SE =.1624.0475 = 3.4189. (e) We want P ( z z) = 2(1.99968) = 2.00032 =.00064. Since this is (much) less than α, we reject H 0 and conclude that the proportion of men who are likely to return the $100 is smaller than the proportion of women. (3) Ten students have first and second exam scores as follows: first 23 33 44 33 84 76 65 79 85 47 second 27 38 52 38 98 90 77 93 100 55. Use this data to give a 95% confidence interval for the improvement in test scores from the first to the second exams. SOLUTION: We want to estimate the mean improvement for all hypothetical students who might take be taking this course. We use the data that is the improvements in scores for the 10 students actually in this course. This data is {4, 5, 8, 5, 14, 14, 12, 14, 15, 8} The mean improvement is x = 9.1. The sample standard deviation is 4.36. We use a t statistic 9 = 10 1 df. t.975 = 2.2622. So we get 5.981 = 9.1 2.2622 4.36/ 10 µ 9.1 + 2.2622 4.36/ 10 = 12.219.

with 95% confidence. (4) Nineteen female rats are fed a specific diet between their 28th day and their 84th day. The groups were chosen randomly and 12 were fed a high protein diet while 7 were fed a low proteien diet. The mean weight gain for the 12 rats on a high protein diet was 120 grams, and the sample standard deviation is 21.39. For the rats on the low protein diet it was 101 grams with a sample standard deviation of 20.62. Construct a 95% confidence interval for the difference between weight gain on the high protein diet and weight gain on the low protein diet. SOLUTION: 120 101 = 19. The standard error for the different weight gains is SE = 21.39 2 12 + 20.622 7 = 9.94. We use 6 degrees of freedom since the smaller sample is size 7. Then t.975 = 2.4469. So 5.32 = 19 2.4469 9.94 µ 1 µ 2 19 + 2.4469 9.94 = 43.32 with 95% certainty. We could use Welch s formula to determine the degrees of freedom instead. This gives df = 13.0845, and so t.975 = 2.1604, and so 2.4744 = 19 2.1604 9.94 µ 1 µ 2 19 + 2.1604 9.94 = 40.4744 which is of course a slightly narrower range. (5) In a 1974 study, 862 Marine recruits were assigned randomly to either take daily 2 gram vitamin C supplements, or a placebos for eight weeks. During the study some recruits were removed from their platoons, and others were excluded from analysis because they didn t take their pills for the entire 8 weeks. At the end of the study there were 331 recruits in the vitamin C group, and 343 in the placebo group. The vitamin C group spend a mean of 20.3 days with a cold, with standard deviation.879 days. The placebo group spent a mean of 20.7 days with a cold, with standard deviation.642. Is there evidence at the 5% significance level that the vitamin C treatment reduces the length of cold symptoms from this study? SOLUTION: Let µ 1 be the mean number of days over an eight week period that marines who are taking the vitamin C supplement have cold symptoms. Let µ 2 be the mean number of days over an eight week period that marines who are not taking the vitamin C suppleents have colds. (a) H 0 : µ 1 = µ 2. (b) H a : µ 1 < µ 2. (c) α =.05. 3

4 (d) To calculate our test statistic, we would use a t statistic, but since n is large, we can approximate by a z statistic. Our standard error is.879 2 SE = 331 +.6422 343 =.059. So 20.3 20.7 z = = 6.7797..059 (e) The p-value is very small, so that P ( z z) < α. So we reject H 0, and conclude that vitamin C supplements change the number of days with cold symptoms. (6) Use a one-sided χ 2 contingency table at the α =.05 level to test whether acupuncture is more successful than placebo using the following table of data: Success Failure Total Acupuncture 7 6 13 Placebo 4 13 17 Total 11 19 30 SOLUTION: Our table of expected counts is Success Failure Total Acupuncture 4.77 8.23 13 Placebo 6.23 10.77 17 Total 11 19 30 (a) H 0 : We see the same percentage of failures and successes in people treated with acupuncture as in people treated with a placebo. (b) H a : We see a higher percentage of successes in the acupuncture treated group. (c) α =.05. (d) Our test statistic is χ 2 (7 4.77)2 (4 6.23)2 = + + 4.77 6.23 (e) Our p-value is (6 8.23)2 8.23 + (13 10.77)2 10.77 = 2.90673. 1 2 P (χ2 2.907) < 1 2 P (χ2 2.7055) = 1.1 =.05. 2 (f) Since the p-value is less than α, we reject H 0 and conclude that acupuncture is more effective than placebo treatment. (7) Suppose independent random samples from two normal populations gave the following results: n X = 25, x = 16, s x = 4.7 n Y = 45, y = 20, s y = 2.3.

Find a conservative.95 confidence interval for the difference µ X µ Y. SOLUTION: x y = 4. The standard error is 4.7 2 SE = 25 + 2.32 45 = 1.00058. We use 24 df and t.975 = 2.0639. So we get 6.0651 = 4 2.0639 1.00058 µ X µ Y 4+2.0639 1.00058 = 1.9349/ 5 (8) How large a sample is needed to obtain an estimate of the mean IQ of college students to within a 95% margin of error of 1 point? (Use σ = 15.) SOLUTION: Let x be our sample mean, and n be the sample size. Then using z.975 = 1.96, we get x 1.96 15/ n µ x + 1.96 15/ n. We want 1.96 15/ n 1. This is the same as 1.96 15 n, which is in turn the same as 1.96 2 225 n. So we need n at least 865 since 1.96 2 225 = 864.36. (9) A professional basketball player used to make 55% of his free throws. His agent claims he has improved, and offers as evidence that he has made 40 out of his last 60 free throws. Test the hypothesis at the.05 significance level that his free throw percentage has improved from 55%. SOLUTION: Our sample proportion is p = 2/3 and our sample size is 60. Let π be the percentage of free throws our basketball player is now making. (a) H 0 : π =.55. (b) H a : π >.55. (c) α =.05. (d) Our test statistic is 40 successes in our sample. The associated z-statistic is 40.55 60 z = = 1.8165. 60.55.45 (e) Our P -value is P (z 1.8165) = (1.965) =.035. (f) The P -value is less than α so we reject H 0 and conclude that the basketball player s freethrow rate is now above 55%. Note that this is contingent on treating his last 60 throws as a random sample of his current ability. This is open to question. (10) Assume an airline flies a plane whose safe passenger load is 16000 pounds. Assume the passengers weigh 190 pounds on average (overall) with a standard deviation of 40 pounds, and that on each flight we have a random sample of 80 passengers. How often will the plane be overloaded?

6 SOLUTION:We assume that the weight of passengers is approximately normal. We are asking how likely 80 passengers are to weigh more than 16000 pounds. The meain weight of 80 passengers is 80 190 = 15200, and the standard deviation is 8040 = 357.71. 16000 = 15200 + 800 = 15200 + 2.236 357.71. So we have to ask for the probability that the total weight is more than 2.236 standard deviations about the mean. Since the distribution is normal, this probability is about.013. (11) Compute µ, E(X 2 ) and σ for the random variable X with pdf { x 1 x 1 f(x) = 0 else. SOLUTION: µ = E(X) = E(X 2 ) = 0 1 0 1 x 2 dx + x 3 dx + 1 0 1 0 x 2 dx = 0. x 3 dx = 2 4 =.5. The variance is E(X 2 ) E(X) 2 =.5, so the standard deviation is 1/ 2. (12) Compute µ, E(X 2 ) and σ for the random variable X with pdf f(x) = We assume a < b here. SOLUTION: µ = E(X) = E(X 2 ) = The variance is b a b E(X 2 ) E(X) 2 = b2 + ab + a 2 3 a { 1 b a a x b 0 else. x b a dx = b2 a 2 2(b a) = b + a 2. x 2 b a dx = b3 a 3 3(b a) = b2 + ab + a 2. 3 b2 + 2ab + a 2 4 = b2 12 2ab 12 + a2 12 (a b)2 =. 12 Thus the standard deviation is (b a)/2 3. (13) The length of time necessary to perform a specific manufacturing operation is a random variable T with mean 12 and standard deviation 2 minutes. Use Chebyshev s Inequality to find a lower bound for P (6 < T < 18). SOLUTION:We are looking at values within 3 standard deviations of the mean. So we are asking for P ( 3 < Z < 3) > 1 1 9 = 8 9.

(14) An urn contains 50 black balls and 10 white balls. You pick 10 balls at random. Give a formula for the probability of picking all black balls. SOLUTION: This is 50 49 48 60 59 58 41 51 = 50!50! 40!60! This is about 14%. (15) Give the definition of E being independent of F. Prove that if E is independent of F then F is independent of E. SOLUTION: E is independent of F means P (E) = P (E F ). The definition of P (E F ) is P (E F )/P (F ). So E independent of F means P (E) = P (E F ) = P (E F ). P (F ) To show F is independent of E, we want to show P (F ) = P (F E). We take the equation above and multiply by P (F ) and then divide by P (E). We get then P (F ) = P (E F ) P (E) = P (F E) which is what we want to prove. (16) State Bayes s Theorem. A test for a particular cancer has sensitivity 2/3, specificity.91. Prevalence among the relevant population is.015. Calculate the predictive value positive. SOLUTION: (a) Bayes s Theorem: P (E)P (F E) P (E F ) = P (E)P (F E) + P (E)P (F E) (b) This is the probability of the disease if the test is positive. P V + P (disease)p (positive disease) = P (disease positive) = P (disease)p (positive disease) + P (no disease)p (positive no disease).015 (2/3) =.015 (2/3) +.985.09 =.01.01 +.08865 =.101368. (17) Suppose π =.55 (a population proportion). But you don t know π and you are testing the hypothesis π =.5 against the hypothesis π >.5. Your procedure will be to pick a random sample of 20 individuals and do a z-test at the.02 significance level. What is the power of this test? SOLUTION: The power of the test tells us how likely we are to reject H 0 when H 0 is false. So o calculate the power here (as H 0 is indeed false) we need the probability of rejecting H 0. We reject H 0 if P (z z).02. That corresponds to z 2.0537. 7

8 If K is the number of successes in our sample of 20, then under the hypothesis H 0 : π =.5, we ll have z = K 10 20.5.5 So z 2.0537 corresponds to K 10 2.0537 that is K 10 4.5922. 2.23607 So we ll reject H 0 when K 15. Given π =.55, we now need to calculate the probability that K 15. We could do this directly, or by using the normal approximation with the continuiuty correction. We ll do it directly: 20 ( ) 20 P (K 15) =.55 i.45 20 i =.055. i i=15