Homework 5 Solutions

Similar documents
A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

PRACTICE PROBLEMS FOR THE FINAL

Simulation. Two Rule For Inverting A Distribution Function

Chapter 8: Estimating with Confidence

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Central Limit Theorem the Meaning and the Usage

Data Analysis and Statistical Methods Statistics 651

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

Statistics 511 Additional Materials

Understanding Samples

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Confidence Intervals for the Population Proportion p

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Topic 9: Sampling Distributions of Estimators

6.3 Testing Series With Positive Terms

Final Review for MATH 3510

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics

Random Variables, Sampling and Estimation

Expectation and Variance of a random variable

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Example: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.

Math 10A final exam, December 16, 2016

Math 140 Introductory Statistics

( 1) n (4x + 1) n. n=0

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Chapter 23: Inferences About Means

Introduction There are two really interesting things to do in statistics.

STAT 515 fa 2016 Lec Sampling distribution of the mean, part 2 (central limit theorem)

Stat 400: Georgios Fellouris Homework 5 Due: Friday 24 th, 2017

Chapter 18 Summary Sampling Distribution Models

Binomial Distribution

Chapter 18: Sampling Distribution Models

Stat 200 -Testing Summary Page 1

Statistical Inference Procedures

Chapter 5: Hypothesis testing

CONFIDENCE INTERVALS STUDY GUIDE

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Exercise 4.3 Use the Continuity Theorem to prove the Cramér-Wold Theorem, Theorem. (1) φ a X(1).

Confidence Intervals QMET103

Read through these prior to coming to the test and follow them when you take your test.

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,

Sample Size Determination (Two or More Samples)

Eco411 Lab: Central Limit Theorem, Normal Distribution, and Journey to Girl State

Discrete Mathematics and Probability Theory Fall 2016 Walrand Probability: An Overview

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

The standard deviation of the mean

AP Statistics Review Ch. 8

BHW #13 1/ Cooper. ENGR 323 Probabilistic Analysis Beautiful Homework # 13

( ) = p and P( i = b) = q.

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22

The picture in figure 1.1 helps us to see that the area represents the distance traveled. Figure 1: Area represents distance travelled

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Estimation for Complete Data

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Topic 9: Sampling Distributions of Estimators

Random Models. Tusheng Zhang. February 14, 2013

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

1 Review of Probability & Statistics

Riemann Sums y = f (x)

(7 One- and Two-Sample Estimation Problem )

1 Inferential Methods for Correlation and Regression Analysis

Estimation of a population proportion March 23,

Solutions. Discussion D1. a. The middle 95% of a sampling distribution for a binomial proportion ˆp is cut off by the two points

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

Section 11.8: Power Series

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence

MATH/STAT 352: Lecture 15

PRACTICE PROBLEMS FOR THE FINAL

Topic 9: Sampling Distributions of Estimators

1 Models for Matched Pairs

Conditional Probability. Given an event M with non zero probability and the condition P( M ) > 0, Independent Events P A P (AB) B P (B)

Frequentist Inference

CH19 Confidence Intervals for Proportions. Confidence intervals Construct confidence intervals for population proportions

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion

Stat 225 Lecture Notes Week 7, Chapter 8 and 11

CS 330 Discussion - Probability

f X (12) = Pr(X = 12) = Pr({(6, 6)}) = 1/36

Randomized Algorithms I, Spring 2018, Department of Computer Science, University of Helsinki Homework 1: Solutions (Discussed January 25, 2018)

Math 113 Exam 3 Practice

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

4.3 Growth Rates of Solutions to Recurrences

32 estimating the cumulative distribution function

Transcription:

Homework 5 Solutios p329 # 12 No. To estimate the chace you eed the expected value ad stadard error. To do get the expected value you eed the average of the box ad to get the stadard error you eed the stadard deviatio of the box. You ca do it without the actual umbers as log as you have the average ad the stadard deviatio. Sice the umbers are bouded betwee -10 ad 10 you ca assume that with 1000 draws the probability histogram for the sum will be approximately ormal. To use the ormal distributio you eed to kow the expected value ad the SD of the the ormal distributio (also called the SE). The expected value of the sum of 1000 draws is just 1000(AVG of the box). The SD for the sum is 1000(SD of the box). So you do have eough iformatio. p352 # 11 I this questio the box that the car is i is fixed, sice the car has already bee hidde. Suppose the car is i box B. Let X i be a radom variable that couts whether the i-th coi flip is heads. The we are lookig at the sum B+X 1 +X 2 + +X 100. So if the sum is 65 the we have 65 = B +X 1 + +X 100 so we ca solve for B to get B = 65 (X 1 +X 2 + +X 100 ). Now we are lookig for the probability histogram for this sum. We ca compute the expected value: 1

E(65 (X 1 + + X 100 )) = 65 E(X 1 + + X 100 ) = 65 100E(X 1 ) = 65 50 = 15 Sice this sum is goig to follow a ormal distributio (by the Cetral Limit Theorem) the Porsche would be most likely to be i the boxes closest to the expected value. So you should look i 10,11,12,13,14,15,16,17,18,19 ad 20. Here the expected value for B is computed just like i the last questio, which comes out to be 45. So you would look i boxes 40, 41, 42, 43, 44, 45, 46, 47, 48, 49 ad 50. Part (c) I geeral if the sum is N you would look i boxes N 55 to N 45. Part (d) To figure out the chaces you eed the SD. Remember, that the SD is ot chaged whe we add a costat to each elemet i a list. So we ca compute the SD as follows: SD(65 (X 1 + + X 100 )) = 65 SD(X 1 + + X 100 ) = 100SD(X 1 ) = 100 1 2 = 5 The chace the would be the area uder the ormal curve from N 55.5 to N 44.5. I SU these poits become 1.1 ad 1.1. So the area uder the curve, ad thus the chace to get the Porsche, is 72.87%. p371 # 1 Number of heads Percet of heads Number of tosses Expected value SE Expected value SE 100 50 5 50% 5% 2,500 1,250 25 50% 1% 10,000 5,000 50 50% 0.5% 1,000,000 500,000 500 50% 0.05% 2

p372 # 5 We are lookig at the sum of 50 radom guests. Let X i deote the weight of the i-th guest. The we are tryig to estimate the sum X 1 + + X 50. We ca compute the expected value ad the SD as usual: E(X 1 + + X 50 ) = 50E(X 1 ) = 50(150) = 7500 SD(X 1 + + X 50 ) = 50SD(X 1 ) = 50(35) 247.5 Now 4 tos is 8000 pouds, so we are iterested i the chace that this sum will be over 8000. I SU 8000 is 8000 7500 2, so we are lookig for the area to the right of 2 i the 247.5 ormal distributio. This is about 2.5%. Souds like the tower of terror :P p383 # 3 The expected value i 100 draws is 1 red marble. (E(X 1 + +X 100 ) = 100E(X 1 ) = 100 1 The SE is: SE = SD(X 1 + + X 100 ) = 1 99 100 100 100 1 You ca t draw fewer tha 0 red marbles, so the chace is 0. Part (c) The area i the ormal approximatio that correspods to drawig less tha 0 red marbles is the area left of 0.5. I SU this is 0.5 1 = 1.5. This is 6.68%. (The book gets its 1 aswer by usig 0 istead of 0.5, but here it is importat to pay attetio to the edpoits, because it chages the aswer so much sice the SD ad expected value are so small.) Part (d) No. There have t bee eough draws to make the ormal curve look like the probability histogram. There are two factors that affect whether the ormal curve approximates the probability histogram; how ubalaced the box is ad how may draws have happeed. If the box is really ubalaced like this oe you eed more draws for the probability histogram to be ormal. I this case with 10000 draws it would be much closer. 100 ). 3

p383 # 4 False. That is how you would regularly compute the cofidece iterval, but as is show i the last problem, the ormal curve does ot approximate the probability histogram well here. So we ca t compute the 95% cofidece iterval i this way :( p391 # 2 Here you just compute the percet i the sample, which is also 47%. It is likely to be off by (0.47)(0.53) 1 SE, which is 2.2%. 500 The previous sample is a simple radom sample, so a 95% cofidece iterval ca be computed as usual. We computed the SE i the previous step, so a 95% cofidece iterval is 47% ± 4.4%. p392 # 4 I the households iterviewed 379 had oe or more cars. This is 75.8%. The SE is (0.758)(0.242) = 1.9%. 500 p393 # 13 I this box P(1) = 0.25, P(2) = 0.5 ad P(5) = 0.25. So the probability histogram is (ii). The probability histogram for the sum of 100 draws should look like a ormal distributio accordig to the Cetral Limit Theorem, so it is (iii). (i) is irrelevat. Worksheet 1 We kow their wiig proportio is curretly 0.375. So the SE for this after 16 games is (0.375)(0.625) 16 = 0.121. The 95% cofidece iterval is just the observed proportio ± 2SE. This is 0.375 ± 0.242. 4

The 99.7% cofidece iterval is the observed proportio ±3SE, which is 0.375 ± 0.363. Part (c) Fially a 50% cofidece iterval is the observed proportio ±0.68SE, which is 0.375±0.082. Worksheet 2 This questio is a tricky oe. It is easiest if we thik about it like questio 4 from last week. Let X i be a radom variable that couts whether or ot the Red Sox wo the i-th game ad similarly let Y i cout wis for the Yakees. Now let Z i = Y i X i. We are iterested i how may games we eed to play for a 95% cofidece iterval for the variable Z 1+ +Z to ot iclude 0. This meas that we wat 0 = E( Z 1+ +Z ) 2SD( Z 1+ +Z ). The first term ca be rewritte: E( Z 1 + + Z ) = E( (Y 1 X 1 ) + + (Y X ) ) = E( Y 1 + + Y ) E( X 1 + + X ) = 0.500 0.375 = 0.125 Ufortuately the SD is a foul beast. First we compute SD(Z 1 ). Z 1 has outcomes -1, 0 ad 1. -1 happes whe the Yakees lose ad the Red Sox wi, which happes with probability (0.5)(0.375) = 0.1875. 0 happes whe they both wi or both lose, which happes with probability (0.5)(0.375)+(0.5)(0.625) = 0.5. Fially 1 happes whe the Yakees wi ad the Sox lose, which happes with probability (0.5)(0.625) = 0.3125. So ow we ca compute the expected value for Z 1 : E(Z 1 ) = 1(0.1875) + 0(0.5) + 1(0.3125) = 0.125 Note: there is a much easier way to compute this. E(Z 1 ) = E(Y 1 X 1 ) = E(Y 1 ) E(X 1 ) = 0.5 0.375 = 0.125. Now we ca compute the SD... ugh :( Z 1 has outcomes -1, 0 ad 1, so (Z 1 E(Z 1 )) 2 has outcomes ( 1.125) 2 = 1.26563, ( 0.125) 2 = 0.015625 ad 0.875 2 = 0.765625. So SD(Z 1 ) = E((Z 1 E(Z 1 )) 2 ) = 1.26563(0.1875) 0.015625(0.5) + 0.765625(0.3125) = 0.484375 = 0.695971. Ok, ow we ca solve the problem. We are iterested i the differeces of their wiig percetages. That is give by the value Z 1+ +Z. Now we compute the expected value ad 5

SD of this: E( Z 1 + + Z ) = E(Z 1 +... Z ) = E(Z 1) +... E(Z ) = E(Z 1) = E(Z 1 ) = 0.125 SD( Z 1 + + Z ) = SD(Z 1 + + Z ) SD(Z1 ) = = SD(Z 1) = 0.695971 So after games the 95% cofidece iterval for the differece of the wiig percetages is 0.125 ± 2 0.695971. We are iterested i whe the left edpoit of this iterval is 0, i.e. 0 = 0.125 1.39. We solve this for ad get = 124. Whew! Worksheet 3 If they keep wiig at a.375 rate the their 95% cofidece iterval would be.375 ± (0.375)(0.625) 2. So we wat to solve for whe the right edpoit for this iterval is 0.5. This happes whe = 60. Worksheet 4 For the Red Sox to wi 90 games they would eed to wi 84 out of the remaiig 146 which is a proportio of 0.575. So if their true wiig proportio is this or less I will lose. Right ow we have a sample of 16 games. For this sample the observed wiig proportio is 0.375 with a SE of 0.121. I SU 0.5 becomes 0.575 0.375 = 1.65. So we are lookig for the area to 0.121 the left of 1.65 o the ormal distributio, which is about 95%. Thus we ca say that the Red Sox s true wiig percetage is less tha 0.575 with 95% cofidece. For the bet this meas that the expected value of the bet is 1000(0.05) 2(0.95) = 48.1. So I should take it! 6

Worksheet 5 After 80 games the Red Sox are still wiig at a rate of 0.375. Now our SE is chaged (0.375)(0.625) though; it is = 0.05. To get a 99.99% cofidece iterval we eed to take 3.9 80 SE s, which gives us the iterval 0.375 ± 0.195. Worksheet 6 First let s compute our cofidece that the Red Sox wiig percetage is greater tha 0.500. This is the area to the right of 0.500 i SU, which is 0.500 0.375 = 2.5. The area is the 0.62%. 0.05 (This meas that i oly 0.62% of samples is the true proportio 2.5 SEs greater tha the sample proportio.) Now we compute the same thig for the Yakees. First their wiig percetage is 46 (0.575)(0.425) 0.575. Now, their SE is = 0.06. So i SU 0.5 is 0.500 0.575 = 1.25. So, i 80 0.06 the Yakees case we are 10.6% cofidet that their true wiig percetage is less tha 0.500. These two evets are idepedet, so to fid our cofidece that both would occur at the same time we ca just multiply our cofidece i each evet. So we are 100(0.0062)(0.106) = 0.07% cofidet that the Yakees wiig percetage is less tha 50% ad the Red Sox s is greater tha 50%. 80 = 7