This midterm covers Chapters 6 and 7 in WMS (and the notes). The following problems are stratified by chapter.

Similar documents
0, otherwise. U = Y 1 Y 2 Hint: Use either the method of distribution functions or a bivariate transformation. (b) Find E(U).

(y 1, y 2 ) = 12 y3 1e y 1 y 2 /2, y 1 > 0, y 2 > 0 0, otherwise.

Probability and Statistics Notes

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

p y (1 p) 1 y, y = 0, 1 p Y (y p) = 0, otherwise.

STA 584 Supplementary Examples (not to be graded) Fall, 2003

[Chapter 6. Functions of Random Variables]

0, otherwise, (a) Find the value of c that makes this a valid pdf. (b) Find P (Y < 5) and P (Y 5). (c) Find the mean death time.

2. This problem is very easy if you remember the exponential cdf; i.e., 0, y 0 F Y (y) = = ln(0.5) = ln = ln 2 = φ 0.5 = β ln 2.

You may use a calculator. Translation: Show all of your work; use a calculator only to do final calculations and/or to check your work.

15 Discrete Distributions

Contents 1. Contents

MATH 360. Probablity Final Examination December 21, 2011 (2:00 pm - 5:00 pm)

Continuous random variables

This exam contains 6 questions. The questions are of equal weight. Print your name at the top of this page in the upper right hand corner.

Business Statistics Midterm Exam Fall 2015 Russell. Please sign here to acknowledge

Problem 1 (20) Log-normal. f(x) Cauchy

STAT 414: Introduction to Probability Theory

STAT 418: Probability and Stochastic Processes

2. The CDF Technique. 1. Introduction. f X ( ).

Class 26: review for final exam 18.05, Spring 2014

THE QUEEN S UNIVERSITY OF BELFAST

Probability Distributions Columns (a) through (d)

Math 151. Rumbos Fall Solutions to Review Problems for Final Exam

Limiting Distributions

STA 4322 Exam I Name: Introduction to Statistics Theory

ASM Study Manual for Exam P, First Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata

Part 3: Parametric Models

ASM Study Manual for Exam P, Second Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata

Limiting Distributions

STAT515, Review Worksheet for Midterm 2 Spring 2019

Errata for the ASM Study Manual for Exam P, Fourth Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA

Stat 366 A1 (Fall 2006) Midterm Solutions (October 23) page 1

Lecture 1: August 28

MAS3301 Bayesian Statistics Problems 5 and Solutions

Statistics Ph.D. Qualifying Exam: Part I October 18, 2003

IE 303 Discrete-Event Simulation

First Year Examination Department of Statistics, University of Florida

1. If X has density. cx 3 e x ), 0 x < 0, otherwise. Find the value of c that makes f a probability density. f(x) =

MAS3301 Bayesian Statistics

2. A music library has 200 songs. How many 5 song playlists can be constructed in which the order of the songs matters?

Chapter 5. Chapter 5 sections

Math/Stats 425, Sec. 1, Fall 04: Introduction to Probability. Final Exam: Solutions

1, 0 r 1 f R (r) = 0, otherwise. 1 E(R 2 ) = r 2 f R (r)dr = r 2 3

f X (x) = λe λx, , x 0, k 0, λ > 0 Γ (k) f X (u)f X (z u)du

Math Fall 2010 Some Old Math 302 Exams There is always a danger when distributing old exams for a class that students will rely on them

Was there under-representation of African-Americans in the jury pool in the case of Berghuis v. Smith?

MATH Notebook 5 Fall 2018/2019

Part 3: Parametric Models

STA 256: Statistics and Probability I

Question Points Score Total: 76

Creating New Distributions

STAT 302 Introduction to Probability Learning Outcomes. Textbook: A First Course in Probability by Sheldon Ross, 8 th ed.

Chapter 4: CONTINUOUS RANDOM VARIABLES AND PROBABILITY DISTRIBUTIONS

3 Continuous Random Variables

HW1 (due 10/6/05): (from textbook) 1.2.3, 1.2.9, , , (extra credit) A fashionable country club has 100 members, 30 of whom are

Math438 Actuarial Probability

Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama

Probability and Stochastic Processes

Qualifying Exam in Probability and Statistics.

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

Continuous Random Variables. and Probability Distributions. Continuous Random Variables and Probability Distributions ( ) ( ) Chapter 4 4.

STA 260: Statistics and Probability II

MATH 3200 PROBABILITY AND STATISTICS M3200SP081.1

Discrete Distributions Chapter 6

First Year Examination Department of Statistics, University of Florida

CONTINUOUS RANDOM VARIABLES

Mathematics 375 Probability and Statistics I Final Examination Solutions December 14, 2009

HW 3. (Posted on March 03, 2017) ax, 0 < x < 1 0, otherwise. f(x)dx = 1).

Introduction and Overview STAT 421, SP Course Instructor

Y i. is the sample mean basal area and µ is the population mean. The difference between them is Y µ. We know the sampling distribution

MATH/STAT 3360, Probability

Sampling Distributions

1 Exercises for lecture 1

Review Quiz. 1. Prove that in a one-dimensional canonical exponential family, the complete and sufficient statistic achieves the

Introduction to Maximum Likelihood Estimation

STA 2101/442 Assignment 3 1

Statistics for scientists and engineers

IE 230 Probability & Statistics in Engineering I. Closed book and notes. 60 minutes.

Point Estimation and Confidence Interval

IEOR 6711: Stochastic Models I SOLUTIONS to the First Midterm Exam, October 7, 2008

Closed book and notes. 120 minutes. Cover page, five pages of exam. No calculators.

Name of the Student: Problems on Discrete & Continuous R.Vs

Discrete Distributions

PROBABILITY DENSITY FUNCTIONS

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011

lim F n(x) = F(x) will not use either of these. In particular, I m keeping reserved for implies. ) Note:

Continuous Random Variables. and Probability Distributions. Continuous Random Variables and Probability Distributions ( ) ( )

14.30 Introduction to Statistical Methods in Economics Spring 2009

STAT Chapter 5 Continuous Distributions

STAT 515 MIDTERM 2 EXAM November 14, 2018

MATH 407 FINAL EXAM May 6, 2011 Prof. Alexander

Continuous Distributions

Math 50: Final. 1. [13 points] It was found that 35 out of 300 famous people have the star sign Sagittarius.

Chapter Learning Objectives. Probability Distributions and Probability Density Functions. Continuous Random Variables

Sampling Distributions

Statistical Methods in Particle Physics

Continuous RVs. 1. Suppose a random variable X has the following probability density function: π, zero otherwise. f ( x ) = sin x, 0 < x < 2

7 Random samples and sampling distributions

Transcription:

This midterm covers Chapters 6 and 7 in WMS (and the notes). The following problems are stratified by chapter. Chapter 6 Problems 1. Suppose that Y U(0, 2) so that the probability density function (pdf) of Y is 1/2, 0 < y < 2 (a) Find the pdf of U = g(y ) = Y 4 + 1 and compute E(U). (b) Suppose that Y 1, Y 2,..., Y n is an iid sample from f Y (y). Derive the pdf of Y (n), the maximum order statistic, and find E(Y (n) ). 2. Actuarial data are used to posit a probability model for Y, the amount of an insurance claim (in $1000s) for a certain class of customers. Suppose that Y follows an exponential distribution with mean 5. What is the probability that the minimum of 4 independent claims Y 1, Y 2, Y 3, Y 4 exceeds $2,000? 3. Suppose that Y 1 has a gamma distribution with parameters α = 2 and β = 1. Suppose that Y 2 has an exponential distribution with mean 1. Suppose that Y 1 and Y 2 are independent. Derive the probability density function of U = Y 1 Y 2. Hint: Use either the distribution function technique or a bivariate transformation technique. Don t use mgfs. 4. In a medium-sized city, there are two hospitals. For a certain cohort of patients, assume that the length of stay in Hospital 1 is normally distributed with mean 4.6 days and standard deviation 0.9 days. Assume that the length of stay in Hospital 2 is normally distributed with mean 4.9 days and standard deviation 1.2 days. Assume additionally that the length of stay in Hospital 1 is independent of the length of stay in Hospital 2. (a) Find the probability that a patient s stay at Hospital 1 would be longer that his/her stay at Hospital 2. (b) Suppose that independent, iid samples of n 1 = n 2 = 4 patients will be taken from each hospital (that is, within each hospital, the 4 patient lengths of stay are iid, and the two samples from different hospitals are independent). Find the probability that the average stay at Hospital 1 would be longer than the average stay at Hospital 2. 5. You should know the following result: Y N (0, 1) = U = Y 2 χ 2 (1). PAGE 1

We have proven this result using the (cumulative) distribution function technique and the method of transformations. (a) Prove this result using the method of moment generating functions. (b) If Y 1, Y 2,..., Y n is an iid N (0, 1) sample, what is the distribution of T = n i=1 Explain (or prove) why your answer is correct. Y 2 i? 6. Suppose that Y 1, Y 2,..., Y n is an iid sample from a N (µ, σ 2 ) distribution. (a) Under the normal model assumption (stated above), find the moment generating function of Y, the sample mean. What is the distribution of Y? (b) In class, we looked at the (empirical) distribution of CD4 counts, observed from a sample of n = 953 Senegalese sex workers. Here is that distribution. Histogram of CD4.MEAN Frequency 0 100 200 300 400 0 500 1000 1500 2000 2500 3000 3500 CD4.MEAN Figure 1: Senegalese sex worker data. CD4 counts for n = 953 subjects. Do you think the normal model is a good model for these data? Why or why not? What about the iid assumption? (c) If, instead of a normal model, we assume that the CD4 counts Y 1, Y 2,..., Y 953 are iid gamma(a, b), find the moment generating function of Y. What is the distribution of Y under this gamma assumption? 7. Prove the following three results. Your proofs should be rigorous (or at least very convincing). PAGE 2

(a) If Y 1, Y 2,..., Y 8 are iid Poisson(θ), then U = Y 1 + Y 2 + + Y 8 Poisson(8θ). (b) If Y 1, Y 2,..., Y 8 are independent N (µ i, σi 2 ), then U = 8 ( ) Yi µ 2 i χ 2 (8). i=1 σ i (c) If Y 1, Y 2,..., Y 8 are iid exponential(β) random variables, then U = a(y 1 + Y 2 + + Y 8 ) gamma(8, aβ), for a > 0. 8. Actuarial data are used to posit a probability model for Y, the amount of an insurance claim (measured in $10,000s) for a certain class of customers. Suppose that Y follows a Weibull distribution with cumulative distribution function (cdf) F Y (y) = 0, y 0 1 e y2, y > 0. Suppose that n = 5 claims Y 1, Y 2,..., Y 5 are made. Treat Y 1, Y 2,..., Y 5 as an iid sample. (a) Find the probability density function (pdf) of the minimum order statistic Y (1). (b) What is the probability that the minimum of the claims exceeds $5,000? That is, what is P (Y (1) > 0.5)? 9. According to Newton s Law of Gravitation, if two bodies are a distance R apart, then the gravitational force U exerted by one body on the other is given by U = g(r) = k R 2, where k is a positive constant. Suppose that R is a random variable with probability density function (pdf) 3r f R (r) = 2, 0 < r < 1 (a) Find the pdf of U. Make sure you note the support. (b) Find E(U 1 ). 10. Suppose that Y 1, Y 2 is an iid sample of size n = 2 from a gamma(2, 1) distribution. Recall that the gamma(2, 1) probability density function (pdf) is given by ye y, y > 0 PAGE 3

From the method of moment generating functions, we know that U = Y 1 + Y 2 gamma(4, 1). Prove this result using either the cumulative distribution function (cdf) technique or the bivariate transformation technique. Hint: I think the bivariate transformation approach is easier, but both do work. 11. An insurance company sells an automotive insurance policy that covers all losses incurred by a policy holder. Let Y 1, Y 2,..., Y 8 denote the losses (measured in $1000) reported by a sample of n = 8 customers, and suppose that we model Y 1, Y 2,..., Y 8 as an iid sample from an exponential distribution with mean β = 2. Let T = Y 1 + Y 2 + + Y 8 denote the sum of the losses. Providing sufficient explanation, derive the moment generating function of T (don t just state the answer). What is the distribution of T? 12. Suppose that Y 1, Y 2,..., Y 5 is an iid sample of size n = 5 from f Y (y), where 3y 2, 0 < y < 1 Find the probability density function of the minimum order statistic Y (1) and compute P (Y (1) > 0.9). 13. Heparin is an anti-clotting medication, and its use is common among premature babies receiving care in neonatal intensive care units (NICU). The idea behind its use is to prevent blood clotting among babies who are receiving important fluids through a catheter (tube injected into the baby). Clotting at the site of catheter injection can be hazardous to the baby. While heparin can be successful at preventing blood clotting, its use has also been associated with increased infection (which is bad). To assess whether a heparin/antibiotic combination is helpful in preventing ancillary infections, a researcher plans to perform a clinical trial to compare premature babies randomised to the following treatment groups: Group 1 : Group 2 : Heparin Heparin + Antibiotic. An important measurement to be studied is the length of time (measured in days) a baby is hospitalized in the NICU. Denote by Y 1 and Y 2 the length of stay for babies assigned to Groups 1 and 2, respectively. For this problem, we will assume that Y 1 N (30, 64) PAGE 4

Y 2 N (25, 36) Y 1 and Y 2 are independent. Compute the probability that a baby assigned to Group 2 has a longer NICU stay than a baby assigned to Group 1. That is, compute P (Y 2 > Y 1 ). 14. Suppose that Y has an exponential distribution with mean β = 1; that is, the probability density function (pdf) of Y is given by Derive the pdf of U = g(y ) = ln Y. e y, y > 0 15. Suppose that Y 1 and Y 2 are numbers generated independently and at random from the interval (0, 1) so that the joint probability density function (pdf) of Y 1 and Y 2 is given by 1, 0 < y1 < 1, 0 < y f Y1,Y 2 (y 1, y 2 ) = 2 < 1 Define U = Y 1 Y 2, the product of Y 1 and Y 2. Derive the pdf of U. Hint: Use either the distribution function technique or a bivariate transformation technique. Don t use mgfs. 16. A triathlon is an athletic event made up of three contests (usually swimming, cycling, and running). Participants are timed for each contest, and the sum of the contest times represents the total time for the participant. For a certain triathlon, where time is measured in minutes, let Y 1 = time for event 1 Y 2 = time for event 2 Y 3 = time for event 3. In addition, for this triathlon, there is a sufficient amount of time between each event so that Y 1, Y 2, and Y 3 are independent. (a) Suppose that Y 1 N (120, 100), Y 2 N (70, 50), and Y 3 N (30, 20). What is the distribution of U = Y 1 + Y 2 + Y 3? Hint: U has a normal distribution, so first I want you to explain or prove why. Then, find the mean and variance of U. (b) What is the probability that a participant in this triathlon will take longer than 250 minutes to complete all three events; that is, what is P (U > 250)? PAGE 5

17. A machine produces spherical containers whose radii, Y, vary according to the probability density function (pdf) given by 3y 2, 0 < y < 1 (a) Find the pdf of U, the surface area of the containers. You will recall that the surface area of a sphere is given by U = g(y ) = 4πY 2. (b) Find E(U). 18. Suppose that Y 1, Y 2,..., Y n are independent random variables, where Y i follows a Poisson distribution with mean λ i ; i = 1, 2,..., n. That is, Y 1 Poisson(λ 1 ) Y 2 Poisson(λ 2 ). Y n Poisson(λ n ). Prove that U = Y 1 + Y 2 + + Y n has a Poisson distribution with mean λ = n i=1 λ i. 19. An engineering system consists of n = 5 components operating together. The component lifetimes, which we will denote by Y 1, Y 2,..., Y 5, are modeled as iid Weibull random variables with common pdf 2y /θ θ e y2, y > 0 (a) First, I want you to show that the cumulative distribution function (cdf) for this Weibull distribution is given by 0, y 0 F Y (y) = 1 e y2 /θ, y > 0. (b) Use the result in (a) to derive the probability density function for Y (1), the minimum component lifetime. Note that you can do this part, even if you could not do part (a). 20. Suppose that Y 1, Y 2 is an iid sample of size n = 2 from a standard normal distribution. Recall that the standard normal pdf is given by 1 2π e y2 /2, < y < Now, define the random variables U 1 = g 1 (Y 1, Y 2 ) and U 2 = g 2 (Y 1, Y 2 ), where g 1 (Y 1, Y 2 ) = Y 1 + Y 2 g 2 (Y 1, Y 2 ) = Y 1 Y 2. PAGE 6

(a) Use a bivariate transformation to derive the joint distribution of U 1 and U 2. (b) What is the (marginal) distribution of U 2? Chapter 7 Problems 21. The lifetime of a certain brand of industrial light bulb, Y, is assumed to follow a gamma distribution with shape parameter α = 3 and scale parameter β = 6. A random sample (i.e., an iid sample) of n = 25 light bulbs will be taken; denote these measurements by Y 1, Y 2,..., Y 25. Approximate the probability that the sample mean Y will be larger than 21. 22. Suppose that Z 1, Z 2,..., Z 6 is an iid sample from a N (0, 1) distribution. Suppose that Z 7 N (0, 1) and that Z 7 is independent of Z 1, Z 2,..., Z 6. Find the distribution of (a) Z = 1 6 6i=1 Z i (b) T = 6 i=1 (Z i Z) 2 (c) U = 3Z 7 / Z1 2 + Z2 2 + Z3 2 (d) V = (Z 2 1 + Z 2 2 + Z 2 3)/(Z 2 4 + Z 2 5 + Z 2 6). 23. Let Y 1, Y 2,..., Y 953 denote CD4 count measurements for n = 953 subjects. Assume that Y 1, Y 2,..., Y 953 are iid gamma random variables with shape α = 9 and scale β = 130. (a) Approximate the probability that the sample mean Y will be between 1150 and 1200. That is, approximate P (1150 < Y < 1200). (b) Your answer in part (a) is an approximation. Explain how you could compute P (1150 < Y < 1200) exactly without using an approximation. Hint: I m not necessarily looking for a numerical answer here. I m more interested in your explanation and mathematical approach. You could even leave your final answer as a definite integral. 24. I have four statistics T 1, T 2, T 3, and T 4. I have determined that T 1 N (0, 1) T 2 N ( 3, 4) T 3 χ 2 (3) T 4 χ 2 (5) T 1, T 2, T 3, and T 4 are mutually independent. PAGE 7

(a) What is the distribution of T 1 T 2? (b) Find a function of T 1, T 2, T 3, and T 4 that has a t distribution with 8 degrees of freedom. (c) Find a function of T 1, T 2, T 3, and T 4 that has an F distribution with 4 (numerator) and 6 (denominator) degrees of freedom. 25. A nerdy professor who teaches mathematical statistics observes Y = the number of absent students each day that class meets. There are n = 25 days of class throughout the semester, so he observes Y 1, Y 2,..., Y 25. Treat these observations as an iid sample from a Poisson distribution with mean λ = 4. (a) Derive the moment generating function of U = Y 1 + Y 2 + + Y 25, the total number of absences he observes throughout the semester. What is the distribution of U? (b) Approximate the probability that U will not exceed 125. Hint: Use CLT. 26. Suppose that Y 1, Y 2,..., Y n is an iid N (µ, σ 2 ) sample. Let Y and S 2 denote the sample mean and sample variance, respectively. (a) Find the moment generating function of Y. What is the distribution of Y? (b) Find the moment generating function of S 2. What is the distribution of S 2? 27. Suppose that Y N (1, 4); i.e., Y has a normal distribution with µ = 1 and σ 2 = 4 U χ 2 (6) V F (6, 1) Y and U are independent. (a) Find a function of Y and U that has a t distribution with 6 degrees of freedom. (b) Find a function of Y and U that has the same distribution as 1/V. 28. Body mass index (BMI) is a useful measure to estimate a healthy body weight based on how tall a person is. Suppose that a random sample of n = 26 severely obese patients is observed and that we model the BMI measurements Y 1, Y 2,..., Y 26 as an iid sample from PAGE 8

a N (40, 25) distribution. Let Y and S 2 denote the sample mean and sample variance, respectively. (a) What are the sampling distributions of Y and S 2? (b) Compute P (Y > 41, S 2 < 16.5) Hint: You will use two probability tables. 29. Suppose that Y 1, Y 2,..., Y n is an iid sample of Bernoulli observations with mean p, where 0 < p < 1. Recall that the sample proportion is p = X n, where X = Y 1 + Y 2 + + Y n. Prove that E( p) = p and that V ( p) = p(1 p). n 30. Suppose that Y 1, Y 2,..., Y 5 is an iid sample of size n = 5 from a N (0, 1) distribution. Let Y and S 2 denote the sample mean and sample variance, respectively. Define the statistics T 1 = T 2 = Y 2 1 (Y 2 2 + Y 2 Y S 2 /5 3 )/2 T 3 = 4S 2 + ( 5Y ) 2 Give the precise distribution of each statistic. You need not derive anything here rigorously; you can use sampling distribution results we discussed/proved in class. However, your arguments should be convincing. A correct answer without explanation receives almost no credit. 31. Let Y denote the time (in minutes) that it takes for a customer representative to respond to a telephone inquiry. It is assumed that Y follows a uniform distribution from 0.5 to 3.5. That is, Y U(0.5, 3.5). In a 2-hour shift, the customer representative responds to n = 15 calls with response times Y 1, Y 2,..., Y 15. Treating these times as an iid sample, approximate the probability that the sample mean response time Y is less than 2.15 minutes. That is, approximate P (Y < 2.15). 32. Suppose that U χ 2 (64), that is, U has a χ 2 distribution with ν = 64 degrees of freedom. (a) Use the Central Limit Theorem (CLT) to approximate P (60 < U < 70). State clearly how you are using the CLT. (b) If V χ 2 (8), independent of U, find a function of U and V that has an F distribution. What are the degrees of freedom associated with your distribution? PAGE 9