Class 26: review for final exam 18.05, Spring 2014

Similar documents
Practice Exam 1: Long List 18.05, Spring 2014

Class 8 Review Problems 18.05, Spring 2014

Review for Exam Spring 2018

This does not cover everything on the final. Look at the posted practice problems for other topics.

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages

Post-exam 2 practice questions 18.05, Spring 2014

Class 8 Review Problems solutions, 18.05, Spring 2014

Central Limit Theorem and the Law of Large Numbers Class 6, Jeremy Orloff and Jonathan Bloom

18.05 Practice Final Exam

Confidence Intervals for Normal Data Spring 2014

Exam 2 Practice Questions, 18.05, Spring 2014

STAT 414: Introduction to Probability Theory

18.05 Exam 1. Table of normal probabilities: The last page of the exam contains a table of standard normal cdf values.

STAT 418: Probability and Stochastic Processes

HST.582J / 6.555J / J Biomedical Signal and Image Processing Spring 2007

Midterm Exam 1 Solution

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Quick Tour of Basic Probability Theory and Linear Algebra

MAT 271E Probability and Statistics

1 Basic continuous random variable problems

Exam 1 Practice Questions I solutions, 18.05, Spring 2014

MAT 271E Probability and Statistics

Continuous Expectation and Variance, the Law of Large Numbers, and the Central Limit Theorem Spring 2014

HW1 (due 10/6/05): (from textbook) 1.2.3, 1.2.9, , , (extra credit) A fashionable country club has 100 members, 30 of whom are

Chapter 2 Class Notes

1 Basic continuous random variable problems

PRACTICE PROBLEMS FOR EXAM 2

Name: Exam 2 Solutions. March 13, 2017

14.30 Introduction to Statistical Methods in Economics Spring 2009

Deep Learning for Computer Vision

MATH 3510: PROBABILITY AND STATS July 1, 2011 FINAL EXAM

6.041/6.431 Spring 2009 Quiz 1 Wednesday, March 11, 7:30-9:30 PM. SOLUTIONS

Recitation 2: Probability

Lecture 2: Repetition of probability theory and statistics

Math Review Sheet, Fall 2008

Discrete Distributions

Chapter 2: The Random Variable

Frequentist Statistics and Hypothesis Testing Spring

What is a random variable

MATH 3C: MIDTERM 1 REVIEW. 1. Counting

Compute f(x θ)f(θ) dθ

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Lecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019

Comparison of Bayesian and Frequentist Inference

Summary of basic probability theory Math 218, Mathematical Statistics D Joyce, Spring 2016

Notes 12 Autumn 2005

ECE 302, Final 3:20-5:20pm Mon. May 1, WTHR 160 or WTHR 172.

If S = {O 1, O 2,, O n }, where O i is the i th elementary outcome, and p i is the probability of the i th elementary outcome, then

1 Preliminaries Sample Space and Events Interpretation of Probability... 13

Learning Objectives for Stat 225

Math 218 Supplemental Instruction Spring 2008 Final Review Part A

Conditional Probability

Each trial has only two possible outcomes success and failure. The possible outcomes are exactly the same for each trial.

4 Lecture 4 Notes: Introduction to Probability. Probability Rules. Independence and Conditional Probability. Bayes Theorem. Risk and Odds Ratio

STAT 4385 Topic 01: Introduction & Review

AP Statistics Cumulative AP Exam Study Guide

Confidence Intervals for the Mean of Non-normal Data Class 23, Jeremy Orloff and Jonathan Bloom

Bayesian Updating with Continuous Priors Class 13, Jeremy Orloff and Jonathan Bloom

STAT2201. Analysis of Engineering & Scientific Data. Unit 3

Dept. of Linguistics, Indiana University Fall 2015

The probability of an event is viewed as a numerical measure of the chance that the event will occur.

1 The Basic Counting Principles

EECS 126 Probability and Random Processes University of California, Berkeley: Fall 2014 Kannan Ramchandran September 23, 2014.

Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices.

Notice how similar the answers are in i,ii,iii, and iv. Go back and modify your answers so that all the parts look almost identical.

I - Probability. What is Probability? the chance of an event occuring. 1classical probability. 2empirical probability. 3subjective probability

Chapter 5. Chapter 5 sections

Covariance and Correlation Class 7, Jeremy Orloff and Jonathan Bloom

Math/Stats 425, Sec. 1, Fall 04: Introduction to Probability. Final Exam: Solutions

Week 2. Section Texas A& M University. Department of Mathematics Texas A& M University, College Station 22 January-24 January 2019

Counting principles, including permutations and combinations.

Name: 180A MIDTERM 2. (x + n)/2

MATH 3510: PROBABILITY AND STATS June 15, 2011 MIDTERM EXAM

Lecture 10. Variance and standard deviation

Part 3: Parametric Models

Chapter 7: Section 7-1 Probability Theory and Counting Principles

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary)

CS37300 Class Notes. Jennifer Neville, Sebastian Moreno, Bruno Ribeiro

Random Variables and Their Distributions

Lecture 1: Probability Fundamentals

Probability, For the Enthusiastic Beginner (Exercises, Version 1, September 2016) David Morin,

Math 493 Final Exam December 01

ELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables

Chapter 3 Discrete Random Variables

MAT 271E Probability and Statistics

Probability Theory. Introduction to Probability Theory. Principles of Counting Examples. Principles of Counting. Probability spaces.

CSE 312: Foundations of Computing II Quiz Section #10: Review Questions for Final Exam (solutions)

CME 106: Review Probability theory

Random variables, distributions and limit theorems

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

Conditional Probability, Independence and Bayes Theorem Class 3, Jeremy Orloff and Jonathan Bloom

Lecture Notes 2 Random Variables. Discrete Random Variables: Probability mass function (pmf)

Problems and results for the ninth week Mathematics A3 for Civil Engineering students

Exercises in Probability Theory Paul Jung MA 485/585-1C Fall 2015 based on material of Nikolai Chernov

BINOMIAL DISTRIBUTION

Statistics for Managers Using Microsoft Excel/SPSS Chapter 4 Basic Probability And Discrete Probability Distributions

1 Presessional Probability

STAT Chapter 5 Continuous Distributions

STA 4321/5325 Solution to Extra Homework 1 February 8, 2017

Chapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations

Transcription:

Probability Class 26: review for final eam 8.05, Spring 204 Counting Sets Inclusion-eclusion principle Rule of product (multiplication rule) Permutation and combinations Basics Outcome, sample space, event Discrete, continuous Probability function Conditional probability Independent events Law of total probability Bayes theorem Random variables Discrete: general, uniform, Bernoulli, binomial, geometric Continuous: general, uniform, normal, eponential pmf, pdf, cdf Epectation = mean = average value Variance; standard deviation Joint distributions Joint pmf and pdf Independent random variables Covariance and correlation Central limit theorem Statistics Maimum likelihood Least squares Bayesian inference Discrete sets of hypotheses Continuous ranges of hypotheses Beta distributions Conjugate priors Choosing priors Probability intervals Frequentist inference NHST: rejection regions, significance NHST: p-values z, t, χ 2 NHST: type I and type II error NHST: power Confidence intervals Bootstrap confidence intervals

Class 26: review for final eam, Spring 204 2 Empirical bootstrap confidence intervals Parametric bootstrap confidence intervals Linear regression Sets and counting Sets:, union, intersection, complement Venn diagrams, products Counting: inclusion-eclusion, rule of product, ( permutations n P k, combinations n C k = n ) k Problem. Consider the nucleotides A, G, C, T. (a) How many ways are there to make a sequence of 5 nucleotides. (b) How many sequences of length 5 are there where no adjacent nucleotides are the same (c) How many sequences of length 5 have eactly one A? Problem 2. (a) How many 5 card poker hands are there? (b) How many ways are there to get a full house (3 of one rank and 2 of another)? (c) What s the probability of getting a full house? Problem 3. (a) How many arrangements of the letters in the word probability are there? (b) Suppose all of these arrangements are written in a list and one is chosen at random. What is the probability it begins with b. Probability Sample space, outcome, event, probability function. Rule: P (A B) = P (A)+P (B) P (A B). Special case: P (A c ) = P (A) (A and B disjoint P (A B) = P (A) + P (B).) Conditional probability, multiplication rule, trees, law of total probability, independence Bayes theorem, base rate fallacy Problem 4. Let E and F be two events. Suppose the probability that at least one of them occurs is 2/3. What is the probability that neither E nor F occurs? Problem 5. Let C and D be two events with P (C) = 0.3, P (D) = 0.4, and P (C c D) = 0.2. What is P (C D)?

Class 26: review for final eam, Spring 204 3 Problem 6. Suppose we have 8 teams labeled T,..., T 8. Suppose they are ordered by placing their names in a hat and drawing the names out one at a time. (a) How many ways can it happen that all the odd numbered teams are in the odd numbered slots and all the even numbered teams are in the even numbered slots? (b) What is the probability of this happening? Problem 7. Suppose you want to divide a 52 card deck into four hands with 3 cards each. What is the probability that each hand has a king? Problem 8. Suppose we roll a fair die twice. Let A be the event the sum of the rolls is 5 and let B be the event at least one of the rolls is 4. (a) Calculate P (A B). (b) Are A and B independent? Problem 9. On a quiz show the contestant is given a multiple choice question with 4 options. Suppose there is a 70% chance the contestant actually knows the answer. If they don t know the answer they guess with a 25% chance of getting it right. Suppose they get it right. What is the probability that they were guessing? Problem 0. Suppose you have an urn containing 7 red and 3 blue balls. You draw three balls at random. On each draw, if the ball is red you set it aside and if the ball is blue you put it back in the urn. What is the probability that the third draw is blue? (If you get a blue ball it counts as a draw even though you put it back in the urn.) Problem. Independence Suppose that P (A) = 0.4, P (B) = 0.3 and P ((A B) C ) = 0.42. Are A and B independent? Problem 2. Suppose that events A, B and C are mutually independent with P (A) = 0.3, P (B) = 0.4, P (C) =. Compute the following: (Hint: Use a Venn diagram) (i) P (A B C c ) (ii) P (A B c C) (iii) P (A c B C) Problem 3. Suppose A and B are events with 0 < P (A) < and 0 < P (B) <. (a) If A and B are disjoint, can they be independent? (b) If A and B are independent, can they be disjoint? Random variables, epectation and variance Discrete random variables: events, pmf, cdf

Class 26: review for final eam, Spring 204 4 Bernoulli(p), binomial(n, p), geometric(p), uniform(n) E(X), meaning, algebraic properties, E(h(X)) Var(X), meaning, algebraic properties Continuous random variables: pdf, cdf uniform(a,b), eponential(λ), normal(µ,σ) Transforming random variables Quantiles Problem 4. Directly from the definitions of epected value and variance, compute E(X) and Var(X) when X has probability mass function given by the following table: X -2-0 2 p(x) /5 2/5 3/5 4/5 5/5 Problem 5. (Epected value and variance) Suppose that X takes values between 0 and and has probability density function 2. Compute Var(X) and Var(X 2 ). Problem 6. The pmf of X is given by the following table Value of X - 0 Probability /3 /6 /2 (a) Compute E(X). (b) Give the pdf of Y = X 2 and use it to compute E(Y ). (c) Instead, compute E(X 2 ) directly from an etended table. (d) Compute Var(X). Problem 7. Compute the epectation and variance of a Bernoulli(p) random variable. Problem 8. Suppose 00 people all toss a hat into a bo and then proceed to randomly pick out a hat. What is the epected number of people to get their own hat back. Hint: epress the number of people who get their own hat as a sum of random variables whose epected value is easy to compute. pmf, pdf, cdf Probability Mass Functions, Probability Density Functions and Cumulative Distribution Functions Problem 9. Y = 2X. Suppose that X Bin(n, ). Find the probability mass function of

Class 26: review for final eam, Spring 204 5 Problem 20. Suppose that X is uniform on [0, ]. Compute the pdf and cdf of X. If Y = 2X + 5, compute the pdf and cdf of Y. Problem 2. Now suppose that X has probability density function f X () = λe λ for 0. Compute the cdf, F X (). If Y = X 2, compute the pdf and cdf of Y. Problem 22. Suppose that X is a random variable that takes on values 0, 2 and 3 with probabilities 0.3, 0., 0.6 respectively. Let Y = 3(X ) 2. (a) What is the epectation of X? (b) What is the variance of X? (c) What is the epection of Y? (d) Let F Y (t) be the cumulative density function of Y. What is F Y (7)? Problem 23. Suppose you roll a fair 6-sided die 25 times (independently), and you get $3 every time you roll a 6. Let X be the total number of dollars you win. (a) What is the pmf of X. (b) Find E(X) and Var(X). (c) Let Y be the total won on another 25 independent rolls. Compute and compare E(X + Y ), E(2X), Var(X + Y ), Var(2X). Eplain briefly why this makes sense. Problem 24. (Continuous random variables) A continuous random variable X has PDF f() = + a 2 on [0,] Find a, the CDF and P (.5 < X < ). Problem 25. For each of the following say whether it can be the graph of a cdf. If it can be, say whether the variable is discrete or continuous. (i) (ii) (iii) (iv)

Class 26: review for final eam, Spring 204 6 (v) (vi) (vii) (viii) Distributions with names Problem 26. (Eponential distribution) Suppose that buses arrive are scheduled to arrive at a bus stop at noon but are always X minutes late, where X is an eponential random variable with probability density function f X () = λe λ. Suppose that you arrive at the bus stop precisely at noon. (a) Compute the probability that you have to wait for more than five minutes for the bus to arrive. (b) Suppose that you have already waiting for 0 minutes. Compute the probability that you have to wait an additional five minutes or more. Problem 27. Normal Distribution: Throughout these problems, let φ and Φ be the pdf and cdf, respectively, of the standard normal distribution Suppose Z is a standard normal random variable and let X = 3Z +. (a) Epress P (X ) in terms of Φ (b) Differentiate the epression from (a) with respect to to get the pdf of X, f(). Remember that Φ (z) = φ(z) and don t forget the chain rule (c) Find P ( X ) (d) Recall that the probability that Z is within one standard deviation of its mean is approimately 68%. What is the probability that X is within one standard deviation of its mean? Problem 28. Transforming Normal Distributions Suppose Z N(0,) and Y = e Z. (a) Find the cdf F Y (a) and pdf f Y (y) for Y. (For the CDF, the best you can do is write it in terms of Φ the standard normal cdf.) (b) We don t have a formula for Φ(z) so we don t have a formula for quantiles. So we have to write quantiles in terms of Φ. (i) Write the.33 quantile of Z in terms of Φ

Class 26: review for final eam, Spring 204 7 (ii) Write the.9 quatntile of Y in terms of Φ. (iii) Find the median of Y. Problem 29. (Random variables derived from normal r.v.) Let X, X 2,... X n be i.i.d. N(0, ) random variables. Let Y 2 n = X +... + X 2 n. (a) Use the formula Var(X j ) = E(X 2 j ) E(X j) 2 to show E(X 2 j ) =. (b) Set up an integral in for computing E(X 4 j ). For 3 etra credit points, use integration by parts show E(X 4 j ) = 3. (If you don t do this, you can still use the result in part c.) (c) Deduce from parts (a) and (b) that Var(X 2 j ) = 2. (d) Use the Central Limit Theorem to approimate P (Y 00 > 0). Problem 30. More Transforming Normal Distributions (a) Suppose Z is a standard normal random variable and let Y = az + b, where a > 0 and b are constants. Show Y N(b, a 2 ). (b) Suppose Y N( σ 2 Y µ, ). Show Problem 3. (Sums of normal random variables) µ follows a standard normal distribution. σ Let X be independent random variables where X N(2, 5) and Y N(5, 9) (we use the notation N(µ, σ 2 )). Let W = 3X 2Y +. (a) Compute E(W ) and Var(W ). (b) It is known that the sum of independent normal distributions is normal. Compute P (W 6). Problem 32. Let X U(a, b). Compute E(X) and Var(X). Problem 33. In n + m independent Bernoulli(p) trials, let S n be the number of successes in the first n trials and T m the number of successes in the last m trials. (a) What is the distribution of S n? Why? (b) What is the distribution of T m? Why? (c) What is the distribution of S n + T m? Why? (d) Are S n and T m independent? Why? Problem 34. Compute the median for the eponential distribution with parameter λ. Joint distributions

Class 26: review for final eam, Spring 204 8 Joint pmf, pdf, cdf. Marginal pmf, pdf, cdf Covariance and correlation. Problem 35. Correlation Flip a coin 3 times. Use a joint pmf table to compute the covariance and correlation between the number of heads on the first 2 and the number of heads on the last 2 flips. Problem 36. Correlation Flip a coin 5 times. Use properties of covariance to compute the covariance and correlation between the number of heads on the first 3 and last 3 flips. Problem 37. Toss a fair coin 3 times. Let X = the number of heads on the first toss, Y the total number of heads on the last two tosses, and Z the number of heads on the first two tosses. (a) Give the joint probability table for X and Y. Compute Cov(X, Y ). (b) Give the joint probability table for X and Z. Compute Cov(X, Z). Problem 38. Let X be a random variable that takes values -2, -, 0,, 2; each with probability /5. Let Y = X 2. (a) Fill out the following table giving the joint frequency function for X and Y. Be sure to include the marginal probabilities. (b) Find E(X) and E(Y ). (c) Show X and Y are not independent. X -2-0 2 total Y 0 4 total (d) Show Cov(X, Y ) = 0. This is an eample of uncorrelated but non-independent random variables. The reason this can happen is that correlation only measures the linear dependence between the two variables. In this case, X and Y are not at all linearly related. Problem 39. Continuous Joint Distributions Suppose X and Y are continuous random variables with joint density function f(, y) = +y on the unit square [0, ] [0, ]. (a) Let F (, y) be the joint CDF. Compute F (, ). Compute F (, y). (b) Compute the marginal densities for X and Y.

Class 26: review for final eam, Spring 204 9 (c) Are X and Y independent? (d) Compute E(X), (Y ), E(X 2 + Y 2 ), Cov(X, Y ). Law of Large Numbers, Central Limit Theorem Problem 40. Suppose X,..., X 00 are i.i.d. with mean /5 and variance /9. Use the central limit theorem to estimate P ( X i < 30). Problem 4. All or None You have $00 and, never mind why, you must convert it to $000. Anything less is no good. Your only way to make money is to gamble for it. Your chance of winning one bet is p. Here are two etreme strategies: Maimum strategy: bet as much as you can each time. To be smart, if you have less than $500 you bet it all. If you have more, you bet enough to get to $000. Minimum strategy: bet $ each time. If p <.5 (the odds are against you) which is the better strategy? What about p >.5 or p =.5? Problem 42. (More Central Limit Theorem) The average IQ in a population is 00 with standard deviation 5 (by definition, IQ is normalized so this is the case). What is the probability that a randomly selected group of 00 people has an average IQ above 5? Problem 43. Hospitals (binomial, CLT, etc) A certain town is served by two hospitals. Larger hospital: about 45 babies born each day. Smaller hospital about 5 babies born each day. For a period of year, each hospital recorded the days on which more than 60% of the babies born were boys. (a) Which hospital do you think recorded more such days? (i) The larger hospital. (ii) The smaller hospital. (iii) About the same (that is, within 5% of each other). (b) Let L i (resp., S i ) be the Bernoulli random variable which takes the value if more than 60% of the babies born in the larger (resp., smaller) hospital on the i th day were boys. Determine the distribution of L i and of S i.

Class 26: review for final eam, Spring 204 0 (c) Let L (resp., S) be the number of days on which more than 60% of the babies born in the larger (resp., smaller) hospital were boys. What type of distribution do L and S have? Compute the epected value and variance in each case. (d) Via the CLT, approimate the.84 quantile of L (resp., S). Would you like to revise your answer to part (a)? (e) What is the correlation of L and S? What is the joint pmf of L and S? Visualize the region corresponding to the event L > S. Epress P (L > S) as a double sum. Post unit 2:. Confidence intervals 2. Bootstrap confidence intervals 3. Linear regression Problem 44. Confidence interval : basketball Suppose that against a certain opponent the number of points the MIT basketaball team scores is normally distributed with unknown mean θ and unknown variance, σ 2. Suppose that over the course of the last 0 games between the two teams MIT scored the following points: 59, 62, 59, 74, 70, 6, 62, 66, 62, 75 Compute a 95% t confidence interval for θ. Does 95% confidence mean that the probability θ is in the interval you just found is 95%? Problem 45. Confidence interval 2 The volume in a set of wine bottles is known to follow a N(µ, 25) distribution. You take a sample of the bottles and measure their volumes. How many bottles do you have to sample to have a 95% confidence interval for µ with width? Problem 46. Polling confidence intervals You do a poll to see what fraction p of the population supports candidate A over candidate B. How many people do you need to poll to know p to within % with 95% confidence? Problem 47. Polling confidence intervals 2 Let p be the fraction of the population who prefer candidate A. If you poll 400 people, how many have to prefer candidate A to make the 90% confidence interval entirely in the range where A is preferred. Problem 48. Confidence intervals 3

Class 26: review for final eam, Spring 204 Suppose you made 40 confidence intervals with confidence level 95%. About how many of them would you epect to be wrong? That is, how many would not actually contain the parameter being estimated? Should you be surprised if 0 of them are wrong? Problem 49. (Confidence intervals) A statistician chooses 20 randomly selected class days and counts the number of students present in 8.05. The find a standard deviation of 4.56 students If the number of students present is normally distributed, find the 95% confidence interval for the population standard deviation of the number of students in attendance. Problem 50. Linear regression (least squares) (a) Set up fitting the least squares line through the points (, ), (2, ), and (3, 3). Also see the eam 2 and post eam 2 practice material and the practice final.

MIT OpenCourseWare https://ocw.mit.edu 8.05 Introduction to Probability and Statistics Spring 204 For information about citing these materials or our Terms of Use, visit: https://ocw.mit.edu/terms.