Solutions to the Fall 2016 CAS Exam S

Size: px
Start display at page:

Download "Solutions to the Fall 2016 CAS Exam S"

Transcription

1 Solutions to the Fall 2016 CAS Exam S There were 45 questions in total, of equal value, on this 4 hour exam. There was a 15 minute reading period in addition to the 4 hours. The Exam S is copyright 2016 by the Casualty Actuarial Society. The exam is available from the CAS. The solutions and comments are solely the responsibility of the author. CAS Exam S prepared by Howard C. Mahler, FCAS Copyright 2016 by Howard C. Mahler. Howard Mahler hmahler@mac.com

2 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 1 1. You are given the following information: An insurance company pays claims according to a Poisson process at a rate of 5 per day. Claims are sub-divided into three categories: Minor, Major, and Severe, with claim amounts provided below: Category Claim Amount Minor 1 Major 4 Severe 10 It is known that the proportion of claims in the Severe category is The total expected claim payment amount in one day is 17. Calculate the proportion of Major claims. A. Less than 0.25 B. At least 0.25, but less than 0.40 C. At least 0.40, but less than 0.55 D. At least 0.55, but less than 0.70 E. At least B. Let the proportion of Major claims be p. 17 = (5) {(1)( p) + 4p + (10)(0.15)}. p = 0.35.

3 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 2 2. For an auto policy, claims occur according to a Poisson process with rate λ = 10 per day. Once reported, claims are independently classified as Property Damage (PD) only, Bodily Injury (BI) only, or both PD and BI. You are given that 30 percent of claims are PD only, 20 percent are BI only, and the remaining 50 percent are both PD and BI. Calculate the probability that in a given day there will be exactly 4 PD only claims, 2 BI only claims, and 6 both PD and BI claims. A. Less than 0.63 percent B. At least 0.63 percent, but less than 0.68 percent C. At least 0.68 percent, but less than 0.73 percent D. At least 0.73 percent, but less than 0.78 percent E. At least 0.78 percent 2. B. Thinning, we get three independent Poisson Processes: PD with λ = 3, BI with λ = 2, and Both with λ = 5. Prob[4 PD] = e / 4! = Prob[2 BI] = e / 2! = Prob[6 Both] = e / 6! = Thus, the desired probability is: (0.1680)(0.2707)(0.1462) = 0.665%.

4 3. The time X to wait in line is an exponentially distributed random variable with mean 5 minutes. Calculate the probability that the total waiting time will be longer than 30 minutes from the time that individual arrived in line, given that the wait has already been 20 minutes. A. Less than 0.1 B. At least 0.1, but less than 0.2 C. At least 0.2, but less than 0.3 D. At least 0.3, but less than 0.4 E. At least B. Since the process is memoryless, we want the probability of waiting at least 10 more minutes: e -10/5 = Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 3

5 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 4 4. Steve catches fish at a Poisson rate of 3 per hour. The price Steve gets at the market for each fish is randomly distributed as follows: Price Probability $10 20% $20 60% $30 20% Using the normal approximation without a continuity correction, calculate the probability that Steve will receive at least $300 for fish caught in a four hour-period. A. Less than 0.19 B. At least 0.19, but less than 0.20 C. At least 0.20, but less than 0.21 D. At least 0.21, but less than 0.22 E. At least C. Mean severity is: (20%)(10) + (60%)(20) + (20%)(30) = 20. Second moment of severity is: (20%)(10 2 ) + (60%)(20 2 ) + (20%)(30 2 ) = 440. For 4 hours, mean aggregate is: (3)(4)(20) = 240. Variance of aggregate is: (3)(4)(440) = Without using the continuity correction: Prob[at least 300] = 1 - Φ[ ] = 1 - Φ[0.83] = =

6 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 5 5. A call center currently has 2 representatives and 2 interns who can handle customer calls. If all representatives including interns are currently on a call, an incoming call will be placed on hold until a representative or intern is available. You are given the following information: For each representative, the time taken to handle each call is given by an exponential distribution with a mean value equal to 1. For each intern, the time taken to handle each call is given by an exponential distribution with mean 2. Handle times are independent A customer calls the call center and is placed on hold, and is the first person in line. Calculate the expected time to complete the call (including both hold time and service). A. Less than 1.2 B. At least 1.2, but less than 1.4 C. At least 1.4, but less than 1.6 D. At least 1.6, but less than 1.8 E. At least D. The time until he is served, is the minimum of 4 independent Exponentials with hazard rates: 1, 1, 1/2, 1/2. This is an Exponential with hazard rate: /2 + 1/2 = 3. The mean time waiting is 1/3. (2)(1) The probability that the first person available is a representative is: (2)(1) + (2)(1/ 2) = 2/3. The mean service time is: (2/3)(1) + (1-2/3)(2) = 4/3. Expected total time is: 1/3 + 4/3 = 5/3.

7 6. You are given the following information: At the time of installation a given machine has the following survival function: S(x) = 1 - Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 6 x 1/ 2, for 0 x 100; x in months 100 After 16 months, that machine is still functioning. Calculate the probability that the machine will stop working between months 36 and 51. A. Less than B. At least 0.095, but less than C. At least 0.105, but less than D. At least 0.115, but less than E. At least C. S(16) = 1-16 / 100 = S(36) = 1-36 / 100 = 0.8. S(51) = 1-51/ 100 = 0.7 Prob[fails between 36 and 51 survives to 16] = S(36) - S(51) S(16) = =

8 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 7 7. You are given the following information about a watch with 6 different parts: There are 3 red wires with expected lifetimes of 50, 75, and 100. There are 3 yellow wires with expected lifetimes of 25, 50, and 75. The lifetimes of all wires are independent and exponentially distributed. Calculate the probability that a red wire will break down before a yellow wire. A. Less than 0.20 B. At least 0.20, but less than 0.25 C. At least 0.25, but less than 0.30 D. At least 0.30, but less than 0.35 E. At least E. Minimum of the failure times for red wires is Exponential with hazard rate: 1/50 + 1/75 + 1/100 = Minimum of the failure times for yellow wires is Exponential with hazard rate: 1/25 + 1/50 + 1/75 = Prob[red before yellow] = = 37.1%.

9 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 8 8. For a parallel system with two independent machines, you are given the following information: The hazard rate for each machine is: 1 µ x =, for 0 x < 100; x in months x One machine has worked for 40 months and the other machine has worked for 60 months. Calculate the probability that the system will function for 20 more months. A. Less than 0.35 B. At least 0.35, but less than 0.55 C. At least 0.55, but less than 0.75 D. At least 0.75, but less than 0.95 E. At least D. H(x) = x 1 dt = ln[100] - ln[100 - x] t 0 S(x) = Exp[-H(x)] = (100-x)/100 = 1 - x/100, for 0 x < 100. Prob[Machine 1 fails in next 20 months] = S(40) - S(60) S(40) = = 1/3. Prob[Machine 2 fails in next 20 months] = S(60) - S(80) S(60) = = 1/2. Prob[systems functions for next 20 months] = 1 - (1/3)(1/2) = 5/6. 1 Comment: For a uniform distribution from 0 to ω, µ x = ω - x.

10 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 9 9. You are given the following information: A system has two minimal path sets: {1, 2, 4} and {1, 3, 4}. Reliability for components 1 and 2 is uniformly distributed from 0 to 1. Reliability for components 3 and 4 is uniformly distributed from 0 to 2. All components in the system are independent. You are starting at time 0. Calculate the probability that the lifetime of the system will be less than A. Less than 0.30 B. At least 0.30, but less than 0.35 C. At least 0.35, but less than 0.40 D. At least 0.40, but less than 0.45 E. At least C. The system function if either: 1, 2, and 4 function, or 1, 3, and 4 function. Thus the system fails if either: 1 fails, 4 fails, or 2 and 3 both fail. Prob[1 fails by 0.25] = Prob[4 fails by time 0.25] = Prob[2 and 3 both fail by time 0.25] = (0.25)(0.125) = The probability that system functions until 0.25 is: (1-0.25) ( ) ( ) = The probability that the lifetime of the system will be less than 0.25 is: = Alternately, Prob[system functions until time 0.25] = Prob[1, 2, and 4 function until time 0.25] + Prob[1, 3, and 4 function until time 0.25] - Prob[1, 2, 3, and 4 function until time 0.25] = (0.75)(0.75)(0.875) + (0.75)(0.875)(0.875) - (0.75)(0.75)(0.875)(0.875) = The probability that the lifetime of the system will be less than 0.25 is: = Comment: S(t) = (1- t)(1 - t/2)(1 - t t/2) = 1-1.5t t t 4, for t 1. S(1) = 0. Thus the mean lifetime of the system is: 1 (1-1.5t t t 4 ) dt = 1-1.5/ /4-0.25/5 =

11 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given the following information about a system: This is a 2-out-of-3 system. All components are independent. The probability of each component functioning is p = Calculate the reliability of the system. A. Less than B. At least 0.970, but less than C. At least 0.975, but less than D. At least 0.980, but less than E. At least B. System works if either exactly 2 or exactly 3 components work. Thus the probability that the system works is: (3)(0.9 2 )(0.1) =

12 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page A firm classifies its workers into one of four categories based on their employment status: 1. Full-Time 2. Part-Time 3. On-Leave 4. Retired Workers independently transition between categories at the end of each year according to a Markov chain with the following transition probability matrix: P = Calculate the probability a current full-time employee will ever go on leave. A. Less than 0.15 B. At least 0.15, but less than 0.30 C. At least 0.30, but less than 0.45 D. At least 0.45, but less than 0.60 E. At least 0.60

13 11. C. Let x be the chance that if the chain is in state 1 it ever enters state 3 in the future. Let y be the chance that if the chain is in state 2 it ever enters state 3 in the future. (Since state 4 is an absorbing state, if the chain is in state 4 it can not enter state 3 in the future.) If the chain is in state 1, the probabilities of the next state are: 0.6, 0.1, 0.1, and 0.2. Therefore: x = 0.6x + 0.1y x = y + 1 If the chain is in state 2, the probabilities of the next state are: 0.4, 0.4, 0.1, and 0.1. Therefore: y = 0.4x + 0.4y y = 4x x = (4x+1)/ x = 7/20 = y = (4)(0.35) - 1 = 0.4. Comment: s ij = the expected number of time periods the chain is in state j, if it starts in state i, for i and j both transient states. Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 12 Let P T be the matrix of transition probabilities for transient states only. Then S = (I-P T ) -1. f ij = probability that the chain visits state j at some time in the future, if it starts in state i, for both i and j transient states. f ij = (s ij - δ ij )/s jj, where δ ij = 1 if i = j and 0 if i j P T = I-P T = Using a computer, (I-P T ) -1 = Then f 13 = (s 13-0)/s 33 = / = f 23 = (s 23-0)/s 33 = / = 0.40.

14 12. You are given the following information: Stock XYZ pays a quarterly dividend based on the current state of the economy, as follows: State Quarterly Dividend per Share 1: Economic Peak : Recession : Economic Trough : Economic Expansion 0.20 The economy changes among these four states each quarter according to the transition probability matrix: P = Annual interest rate i = 0%. Transition happens after the dividend is paid. Calculate the long-term quarterly dividend per share for Stock XYZ. A. Less than 0.08 B. At least 0.08, but less than 0.11 C. At least 0.11, but less than 0.14 D. At least 0.14, but less than 0.17 E. At least B. Going down the columns of the transition matrix, the balance equations are: 0.2 π π 4 = π 1. π 4 = 2π π π 2 = π 2. π 2 = 8π 1 / π π 3 = π 3. π 3 = π 2 /2 = 4π 1 / π π 4 = π 4. Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 13 Also, π 1 + π 2 + π 3 + π 4 = 1. π 1 (1 + 8/3 + 4/3 + 2) = 1. π 1 = 1/7. π 2 = 8/21. π 3 = 4/21. π 4 = 2/7. The long-term quarterly dividend per share is: (1/7)(0.3) + (8/21)(0.01) + (4/21)(0) + (6/21)(0.2) =

15 13. You are given the following information for ABC Insurance Company, which sells auto policies: Customers transition each policy period between three states. The average service cost per customer per policy period varies by state as follows: State Average Service Cost per Customer 1: Gold 2 2: Silver 1 3: Copper 15 The transition probability matrix is: P = Calculate the long-term average service cost per customer per policy period. A. Less than 3.0 B. At least 3.0, but less than 5.0 C. At least 5.0, but less than 7.0 D. At least 7.0, but less than 9.0 E. At least B. Going down the columns of the transition matrix, the balance equations are: 0.8 π π 2 = π 1. π 1 = π π π π 3 = π π π 3 = π 3. π 3 = π 2 /2. Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 14 Also, π 1 + π 2 + π 3 = 1. π 2 ( /2) = 1. π 2 = 0.4. π 1 = 0.4. π 3 = 0.2. The long-term average service cost per customer per policy period is: (0.4)(2) + (0.4)(1) + (0.2)(15) = 4.2.

16 14. Two models are used to estimate the probability of at least one claim for an auto policy in a given year. The models are defined as follows: Model I: Independent events with probability of occurrence of at least one claim in a given year equal to 0.1. Model II: Markov Chain with conditional probability of at least one claim given no claims in prior year equal to 0.08, and conditional probability of at least one claim given at least one claim in prior year equal to There are no claims in Year 1. The probability of at least one claim in Year 2 and no claims in Year 3 and at least one claim in Year 4 are denoted by P I and P II, for models I and II respectively. Calculate I P I - P II I. Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 15 A. Less than 0.3 percent B. At least 0.3 percent, but less than 0.4 percent C. At least 0.4 percent, but less than 0.5 percent D. At least 0.5 percent, but less than 0.6 percent E. At least 0.6 percent 14. B. For Model I, the probability is: (0.1)(0.9)(0.1) = For Model II, the probability is: (0.08)(0.88)(0.08) = The absolute difference is: = %. Comment: Does not really make use of Markov Chain techniques. For Model II, the transition matrix is: No Yes

17 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given the following information: A new medical breakthrough is expected to reduce the probability of death between ages 70 and 75 by one-half. Prior to the breakthrough, mortality follows the Illustrative Life Table. Calculate the probability that a life (50) lives to age 75 after the breakthrough. A. Less than 0.64 B. At least 0.64, but less than 0.66 C. At least 0.66, but less than 0.68 D. At least 0.68, but less than 0.70 E. At least C. For the Illustrative Life Table: 20 p 50 = S(70)/S(50) = 6,616155/8,950,901 = For the Illustrative Life Table: 5 p 70 = S(75)/S(70) = 5,396,081/6, = However, the probability of death between ages 70 and 75 is only one-half of what it was or: ( )/2 = Thus the probability that a life (50) lives to age 75 after the breakthrough is: (0.7392)( ) = Comment: Multiplying the mortality rate by a constant, takes the survival function to the power of that constant. Thus if instead the mortality rates between ages 70 and 75 were reduced by a half, then the probability that a life (50) lives to age 75 would be: (0.7392)( ) =

18 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given the following information: There are three independent lives (50), (60), and (70). Mortality follows the Illustrative Life Table. Calculate the probability that (70) will be the only person alive after 20 years. A. Less than B. At least , but less than C. At least , but less than D. At least , but less than E. At least B. 20 p 50 = S(70)/S(50) = 6,616155/8,950,901 = p 60 = S(80)/S(60) = 3,914,365/8,188,074 = p 70 = S(90)/S(70) = 1,058,491/6, = The probability that (70) will be the only person alive after 20 years is: (0.1560) ( ) ( ) =

19 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given an independent random sample of size n, x 1 x 2,..., x n, from an exponential distribution with mean θ. Calculate the Rao-Cramer lower bound for θ^, an unbiased estimator of θ. A. θ B. X C. 1/θ 2 D. θ 2 /n E. 1/θ^ D. f(x) = (1/θ) e -x/θ. ln f(x) = - ln(θ) - x/θ. ln[f(x)] θ = -1/θ + (x/θ 2 ). 2 ln[f(x)] θ 2 = 1/θ 2 - (2x/θ 3 ). E[ 2 ln[f(x)] θ 2 ] = 1/θ 2-2E[X]/θ 3 = 1/θ 2-2θ/θ 3 = -1/θ 2. Rao-Cramer lower bound for θ^ is: -1 n E[ 2 ln[f(x)] θ 2 ] = θ 2 / n. Alternately, the Exponential Distribution is a linear exponential family, and thus maximum likelihood is equal to the method of moments and the variance of the estimated parameter is equal to the Rao-Cramer lower bound. θ^ = X. Var[θ^ ] = Var[X ] = Var[X]/n = θ 2 / n. Comment: The Rao-Cramer lower bound contains 1/n, eliminating all of the choices but D.

20 18. You are given an independent random sample of size 50, y 1, y 2,..., y 50, from a population with mean µ and variance σ 2. Two estimators for µ are: µ^ 1 = the sample mean µ^ 2 = the weighted average sample mean, where the first 10 observations receive twice as much weight as the other observations. Calculate the efficiency of µ^ 1 relative to µ^ 2. A. Less than B. At least 1.025, but less than C. At least 1.050, but less than D. At least 1.075, but less than E. At least Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page E. Var[µ^ 1] = σ 2 /50 = 0.02σ 2. µ^ 2 = X i + X i i=1 11 (2)(10) + 40 = X i + X i i= Var[µ^ 2] = Var[ X i ] + Var[ X i ] i= = (4)(10σ2 ) + (40σ 2 ) 602 = σ 2 80/3600 = σ 2. The efficiency of µ^ 1 relative to µ^ 2 is: Var[µ^ 2]/Var[µ^ 1] = / 0.2 = Comment: For two unbiased estimators of the same quantity, define the relative efficiency of estimator B compared to estimator A as Variance[A]/Variance[B]. Estimator 1 has a smaller variance than estimator 2, thus estimator 1 is more efficient..

21 19. You are given: I. If θ^ n is an unbiased estimator of θ and the variance of θ^ n = 1/n, it must be a consistent estimator of θ. Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 20 II. If θ^ n is a consistent estimator of θ, it must be an unbiased estimator of θ. III. If θ^ n is a biased estimator of θ, it is not a consistent estimator of θ. Determine which of the following is correct. A. I is true; II is false; III is true. B. I is false; II is false; III is true. C. I is true; II is false; III is false. D. I is true; II is true; III is false. E. I is false; II is true; III is true, 19. C. An asymptotically unbiased estimator, whose variance goes to zero as the number of data points goes to infinity, is consistent. Unbiased Asymptotically unbiased. So Statement I is true. Statement II is not true. Statement III is not true. Comment: An estimator could have any of the four possible combinations of biased/unbiased and consistent/inconsistent. Statement 2: Consistent. Unbiased. Statement 3: Not Unbiased. Not Consistent. Statement 3 is the contrapositive of Statement 2; thus Statements 2 and Statements 3 are either both true or both false.

22 20. You are given an independent random sample from a normal distribution X with unknown mean, µ. You are given the following information: σ 2 (X) = 400 H 0 : µ = 0 H 1 : µ = 10 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 21 You will reject the null hypothesis if X > α for some value of α. Calculate the minimum sample size needed so that the probability of a Type I error and the probability of a Type II error are each no more than 10%. A. Less than 10 B. At least 10, but less than 15 C. At least 15, but less than 20 D. At least 20, but less than 25 E. At least E. Variance of the average is 400/n. 10% = significance = Prob[X > α µ = 0] = 1 - Φ[ α 400 / n ]. α 400 / n = % = prob. of a Type II error = Prob[X < α µ = 10] = 1 - Φ[ α - 10 α - 10 ]. 400 / n 400 / n = Dividing the two equations: α/(α-10) = -1. α = 5. n = 400 (1.282/5) 2 = Since the sample size is integer, the minimum sample size needed is 27.

23 21. You are given: A coin has two sides, Heads and Tails This coin is tossed 400 times H 0 : Probability of Heads = 0.50 H 1 : Probability of Heads 0.50 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 22 H 0 is not rejected if the number of heads in the 400 tosses is between 184 and 216 inclusive, and rejected otherwise The normal approximation to the binomial distribution is used Calculate the probability of a Type I error. A. Less than 0.01 B. At least 0.01, but less than 0.03 C. At least 0.03, but less than 0.05 D. At least 0.05, but less than 0.07 E. At least E. For q = 0.5, the mean is 200 and the variance is: (0.5)(1-0.5)(400) = Prob[reject H 0 ] = Φ[ ] - Φ[ ] = Φ[1.65] - Φ[-1.65] = = Probability of a Type I error is: = Without using the continuity correction: Prob[reject H 0 ] = Φ[ ] - Φ[ ] = Φ[1.60] - Φ[-1.60] = Probability of a Type I error is: =

24 22. You are given: A coin has two sides, Heads and Tails This coin is tossed 150 times H 0 : Probability of Heads = 0.50 H 1 : Probability of Heads = 0.60 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 23 H 0 is not rejected if the number of heads in the 150 tosses is less than 79, and rejected otherwise The normal approximation to the binomial distribution is used Calculate the probability of a Type II error. A. Less than 0.01 B. At least 0.01, but less than 0.03 C. At least 0.03, but less than 0.05 D. At least 0.05, but less than 0.06 E. At least B or C. For q = 0.6, the mean is: (0.6)(150) = 90, and variance is: (0.6)(0.4)(150) = Probability of a Type II error = Prob[fail to reject H 1 ] = Φ[ ] = Φ[-1.92] = 2.74%. 36 Without using the continuity correction one could get either of two answers: Φ[ ] = Φ[-1.83] = 3.36%, or Φ[ ] = Φ[-2.00] = 2.28%

25 23. You are given: H 0 : θ = θ 0 H 1 : θ = θ 1 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 24 An independent random sample is drawn from a distribution with probability mass function f(x θ), for x = 1, 2,..., 7. The values of the likelihood function at θ 0 and θ 1 are given in the table below. x f(x θ 0 ) f(x θ 1 ) The Neyman-Pearson lemma is used to find the most powerful test for H 0 versus H 1 with significance level α = Calculate the probability of Type II error for this test. A. Less than 0.02 B. At least 0.02, but less than 0.32 C. At least 0.32, but less than 0.62 D. At least 0.62, but less than 0.92 E. At least D. The critical region is where f(x θ 1 )/f(x θ 0 ) is big. x f(x θ 0 ) f(x θ 1 ) f(x θ 1 )/f(x θ 0 ) We want the critical region to cover 4% probability when H 0 is true. Thus we reject when x is 1, 2, 3, or 4. Thus the probability of Type II error = Prob[fail to reject H 1 ] = Prob[x 5 H 1 ] = 2% + 1% + 79% = 82%. Comment: We need to assume that a single observation is being used to test H 0 vs. H 1. If we are taking a sample of size greater than 1, things quickly get complicated. The power of the test is: = 0.18 =

26 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given an independent random sample of 10 draws from an exponential distribution with mean θ. The lowest value is excluded, and Y is the expected average value of the remaining 9 draws. Calculate Y / θ. A. Less than 1.02 B. At least 1.02, but less than 1.07 C. At least 1.07, but less than 1.12 D. At least 1.12, but less than 1.17 E. At least C. The expected value of the whole sample is θ. The expected value of the minimum is θ/10. The expected value of the remaining 9 claims is Y. Therefore, θ = (1/10) (θ/10) + (9/10) Y. Y = 1.1θ. Y/θ = 1.1. Alternately, E[X (2) ] = θ(1/10 + 1/9). E[X (3) ] = θ(1/10 + 1/9 + 1/8). E[X (10) ] = θ(1/10 + 1/9 + 1/ /1). Y = {E[X (2) ] + E[X (3) ] E[X (10) ]} / 9 = θ{9/10 + 9/9 + 8/8 + 7/ }/9 = θ9.9/9 = 1.1θ. Y/θ = 1.1.

27 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given an independent random sample of size 40, x 1, x 2,..., x 40, from an exponential distribution with mean 150. Calculate the probability that the second smallest observation (the observation with one observation smaller and 38 observations larger in the dataset) is greater than 20. A. Less than 0.02 B. At least 0.02, but less than 0.04 C. At least 0.04, but less than 0.06 D. At least 0.06, but less than 0.08 E. At least B. Prob[X (2) > 20] = Prob[all 40 are greater than 20] + Prob[exactly 39 are greater than 20] = S(20) F(20) S(20) 39 = (e -20/150 ) 40 + (40)(1 - e -20/150 )(e -20/150 ) 39 = 3.24%. Alternately, g k (y k ) = n! (k -1)! (n- k)! [F(y)]k-1 [1-F(y)] n-k f(y). Thus the density of the second order statistic (k = 2) is: n (n-1) F(y) f(y) S(y) n-2. In this case, this density is: (40)(39) (1 - e -y/150 ) (e -y/150 /150) (e -y/150 ) 38 = 10.4 (e -y39/150 - e -y40/150 ). Prob[X (2) > 20] = 10.4 e - y39/ e - y40 / 150 dy = 20 (10.4) {exp[-(20)(39)/150] 150/39 - exp[-(20)(40)/150] 150/40} = (10.4) ( ) = 3.24%. Comment: Similar to CAS3, 11/06, Q.9. Let W 1 and W 2 be two independent Exponential Distributions each with mean 150. Then X (2) is: W 1 /40 + W 2 /39. I do not see how this would help you answer this question. y(e - y / θ / θ) dy = θ. y(e - y / θ ) dy = θ E[X (2) ] = 10.4 y(e - y39/ e - y40 / 150 ) dy = (10.4) {(150/39) 2 - (150/40) 2 } = = (150) (1/40 + 1/39).

28 26. You are testing the hypothesis that the median values from two populations, m 1 and m 2 are the same. H 0 : m 1 = m 2 H 1 : m 1 m 2 Sample size n 1 = 6 Sample size n 2 = 8 Wilcoxon rank sum of sample 1: W 1 = 34 Calculate the p-value of this test. A. Less than 0.01 B. At least 0.01, but less than 0.05 C. At least 0.05, but less than 0.10 D. At least 0.10, but less than 0.20 E. At least 0.20 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page D. U = n 1 n 2 + n 1 (n 1 +1)/2 - (sum of ranks of first sample) = (6)(8) + (6)(7)/2-34 = 35. U has support on the integers from 0 to n 1 n 2 = (6)(8) = 48. In order to use the table attached to the exam, we need to subtract U from 48: = 13. The two sided p-value is: 2 Prob[U 13] = (2)(9.0%) = 18.0%.

29 27. You are given the results of a matched pairs experiment by policy. Policy Sample Loss Modeled Loss S 0 5,000 T 0 10,000 U 0 15,000 V 0 20,000 W 0 25,000 X 0 30,000 Y 80,000 35,000 Z 0 50,000 H 0 : Median Sample = Median Modeled H 1 : Median Sample > Median Modeled Calculate the smallest value for α for which H 0 can be rejected using the Sign-Rank Wilcoxon test for a matched pairs experiment. A. Less than B. At least 0.005, but less than C. At least 0.010, but less than D. At least 0.025, but less than E. At least Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page E. This is a one-sided test. The median of the sample is 0, while that of the model is 22,500. Thus there is no evidence that Median Sample > Median Modeled. Thus the p-value is much greater than 5%. Comment: I assume the question should have read H 1 : Median Sample < Median Modeled. In which case, one would take the differences; I chose to take model minus sample: 5K, 10K, 15K, 20K, 25K, 30K, -45K, 50K. So the sum of the negative ranks is 7. Looking in the table attached to the exam, for n = 8: 7 > 5. The one sided p-value is greater than 5%.

30 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given the following random sample, x 1, x 2,..., x 7, of size 7: a from the following distributions: f(x λ) = e- λ λ x, x {0, 1, 2,...} x! f(λ) = λ2 2 e λ, λ > 0, θ > 0 Calculate the value of a such that the Bayesian point estimate for λ = 3. A. Less than 2 B. At least 2, but less than 6 C. At least 6, but less than 10 D. At least 10, but less than 14 E. At least D. This is a Gamma-Poisson. The prior Gamma has α = 3 and θ = 1. The Posterior Gamma has αʼ = a = 11 + a, and 1/θʼ = 1/ θʼ = 1/8. Thus, 3 = αʼ θʼ = (11 + a)/8. a = 13. Alternately, using Buhlmann Credibility, K = 1/θ = 1/1 = 1. Z = N/(N+K) = 7/(7 + 1) = 7/8. Observed mean = (8 + a)/7. Prior Mean = (3)(1) = 3. Thus 3 = (7/8)(8 + a)/7 + (1/8)(3). a = 13. Comment: The density f(λ) did not need θ > 0.

31 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given the following information: Policies are divided into two classes, Safe and Unsafe Claims per year per policy follow a Poisson distribution Classes Percent of Population Mean Claims Per Year Per Policy Safe 40% 0.4 Unsafe 60% 0.8 A randomly selected policy had 1 claim in the first year. Calculate the probability that this randomly selected policy is an Unsafe policy. A. Less than 0.66 B. At least 0.66, but less than 0.68 C. At least 0.68, but less than 0.70 D. At least 0.70, but less than 0.72 E. At least B. Chance of observation if Safe: 0.4e -0.4 = Chance of observation if Unsafe: 0.8e -0.8 = We apply of Bayes Theorem; posterior probabilities are proportional to: (a priori chance of risk type) (chance of observation given the risk type). Probability weight for Safe: (40%)(0.2681) = Probability weight for UnSafe: (60%)(0.3595) = Posterior probability of Unsafe is: / ( ) = 66.8%.

32 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page For a large portfolio of insurance contracts, ε represents the proportion of policies for which there were claims made within one year. The random variable ε has a beta prior distribution with parameters a = 10, b = 5, and θ = 1. In a given year, for 100 randomly selected policies from the portfolio, there were 22 claims. Calculate the posterior mean of ε. A. Less than 0.15 B. At least 0.15, but less than 0.25 C. At least 0.25, but less than 0.35 D. At least 0.35, but less than 0.45 E. At least C. We have to assume no more than one claim per policy. Then this is a Beta-Bernoulli. aʼ = = 32. bʼ = = 83. Posterior mean is: aʼ/(aʼ + bʼ) = 32/( ) = 27.8%. Comment: As written, one can have more than one claim per policy. The question should be rewritten in either of two ways. Either one can have at most one claim per policy, or there are 22 out of of the 100 policies which had at least one claim.

33 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page Within the context of Generalized Linear Models, suppose that y has an exponential distribution with probability density function expressed as: f(y) = 1 µ exp(- y ); for y > 0 µ Determine the variance of y in terms of µ. A. 1/µ B. µ C. µ D. µ 2 E. Cannot be determined from the given information 31. D. An Exponential Distribution with mean µ has variance µ 2. Comment: In the tables attached to the exam, the Exponential Distribution has mean θ, second moment 2θ 2, and thus variance θ 2. Does not depend on being within the context of Generalized Linear Models. For a Gamma Distribution: Var[Y] = αθ 2 = (αθ) 2 / α = E[Y] 2 / α. Here α = 1.

34 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given the following GLM output: Response variable Pure Premium Response distribution Gamma Link log Parameter df β^ Intercept Risk Group 2 Group Group Group Vehicle Symbol 1 Symbol Symbol Calculate the predicted pure premium for an insured in Risk Group 2 with Vehicle Symbol 2. A. Less than 135 B. At least 135, but less than 140 C. At least 140, but less than 145 D. At least 145, but less than 150 E. At least D. Exp[ ] = e 5 =

35 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given the following two probability density functions: (i) f(y ; θ) = θy -θ-1 ; for y > θ (ii) f(y ; θ) = θe -yθ ; for y > 0 and θ > 0 Determine which of the following statements is true. A. (i) doesn't belong to the exponential family B. (i) and (ii) both belong to the exponential family C. Only (i) is in canonical form D. (i) and (ii) both are in canonical form E. None of above are true 33. A. The first density does not integrate to one over its support: y = θ / y θ + 1 dy = -θ y θ = 1/θ θ. Thus it is not an exponential family. θ y = θ The first distribution has the parameter as part of its support, and thus is not an exponential family. ln[f(y)] = ln[θ] + ln[y] (-θ-1). However, this does have the form a(y) b(θ) + c(θ) + d(y), with a(y) = ln[y], b(θ) = -θ - 1, c(θ) = ln[θ], and d(y) = 0. Since a(y) y, this is not in canonical form. The second distribution is an Exponential with θ the hazard rate; it is an exponential family in canonical form. ln[f(y)] = -yθ + ln[θ]. This has the form a(y) b(θ) + c(θ) + d(y), with a(y) = y, b(θ) = -θ, c(θ) = ln[θ], and d(y) = 0. Since a(y) = y, this is in canonical form. Comment: The CAS intended the answer to be B, that both densities are exponential families. Definition in Hogg, McKean, and Craig states that the support can not depend on the parameter θ. However, Section 3.2 of Dobson and Barnett does not include that portion of the commonly used definition of exponential families. This exam question is taken from Exercise 3.3 in Dobson and Barnett.

36 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are given the following information for a fitted GLM: Response variable Status (1 or 0) Response distribution Bernoulli Link Logit AIC Parameter df β^ Intercept x x The deviance residual value for the first observation is The corresponding hat matrix diagonal value is Calculate the standardized deviance residual for the first observation. A. Less than 0 B. At least 0, but less than 0.40 C. At least 0.40, but less than 0.80 D. At least 0.80, but less than 1.20 E. At least C. The standardized deviance residual is the deviance residual divided by 1 - h ii : / = Comment: The (nonstandardized) deviance residual of is also in answer range C. n For the Binomial, the deviance is: 2 {y i ln[ y i ] + (m y^ i - y i ) ln[ m i - y i ] }. i=1 i m i - y^ i Here we have a Bernoulli, so m = 1. If for example the first observation is 1 and y^ 1 = , then the first residual is: > 0. The contribution of the first observation to the deviance is: 2 {(1) ln[1/0.8868] + (1-1) ln[0/( )]} = -2 ln[0.8868] = Since the ordinary residual is positive, the first deviance residual is: = , matching the value given in the question.

37 35. You are given, y 1 y 2,..., y n, independent and Poisson distributed random variables with respective means µ i for i = 1, 2,..., n. Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 36 A Poisson GLM was fitted to the data with a log-link function expressed as: E(y i ) = exp[β 0 + β 1 x i ] where x i refers to the predictor variable. Analysis of a set of data provided the following output: x i y i y^ i y i log(y i /y^ i) Calculate the observed deviance for testing the adequacy of the model. A. Less than 4.0 B. At least 4.0, but less than 6.0 C. At least 6.0, but less than 8.0 D. At least 8.0, but less than 10.0 E. At least 10.0

38 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 37 n 35. D. For the Poisson, the deviance is: 2 {y i ln[y i / λ^ i ] - (y i - λ^ i ) } i=1 n y i ln[y i / λ^ i ] = = i=1 n y i = = 51. i=1 n λ ^ i = = 51. i=1 n 2 {y i ln[y i / λ^ i ] - (y i - λ^ i ) } = (2) ( ) = i=1 Comment: See Table 5.1 in Dobson and Barnett. For the Poisson, provided there is an intercept in the model: n n y i = λ^ i. i=1 i=1 n Thus the deviance is equal to: 2 y i ln[y i / λ^ i ]. i=1 For example, the first contribution to the deviance is: (2) { (7-6)} = Since 7-6 = 1 > 0, the first deviance residual is positive: = The deviance residuals are: 0.398, 1.139, , , 1.229, 0.527, ,

39 36. You are given the following information for a fitted model: Response variable Rating Response distribution Normal Link Identity Residual Std. Error Parameter df β^ Intercept Complaints Privileges Learning Raises Critical The first record in the data has the following values: ID Rating Complaints Privileges Learning Raises Critical The corresponding hat matrix diagonal value is Calculate the DFITS for the observation above. A. Less than -3 B. At least -3, but less than -2 C. At least -2, but less than -1 D. At least -1, but less than 0 E. At least C. Observed value of the rating is 43. Using the Identity link, the fitted value of the rating is: (51)(0.692) + (30)(-0.104) + (39)(0.249) + (61)(-0.033) + (92)(0.015) = Residual is: = Standardized residual is: Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 38 ε^i s 1 - h ii = = DFFITS i = (i th standardized residual) hii 1- h ii = (-1.577) =

40 37. You are given the outputs from two GLMs fitted to the same data from a trial of a new drug. Model 1 Model 2 Response variable Number Response variable Number Response distribution Poisson Response distribution Negative binomial Link Log Link Log AIC AIC Parameter Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 39 β^ s. e. (β^ ) Parameter β^ s. e. (β^ ) Intercept Intercept Treatdrug Treatdrug Placebo Placebo Drug Drug Age Age Determine which of the following statements is false using the Wald test. A. Under Model 1, the Treatdrug coefficient has a p-value less than B. Under Model 1, the Age coefficient has a p-value less than C. Under Model 2, the Treatdrug coefficient has a p-value less than D. Under Model 2, the Age coefficient has a p-value less than E. Under both models, the intercept coefficient has a p-value of less than D. The Wald statistic for Treatdrug coefficient for Model 1 is: /0.118 = p-value = 2 Φ[ ] < The Wald statistic for Age coefficient for Model 1 is: /0.006 = p-value = 2 Φ[-6.5] < The Wald statistic for Treatdrug coefficient for Model 2 is: /0.369 = p-value = 2 Φ[-3.707] < The Wald statistic for Age coefficient for Model 2 is: /0.021 = p-value = 2 Φ[-1.86] = (2)(0.0314) = 6.28%. The Wald statistic for the intercept coefficient for Model 1 is: 4.529/0.147 = p-value = 2 (1 - Φ[30.81]) < The Wald statistic for the intercept coefficient for Model 2 is: 4.526/0.595 = 7.6. p-value = 2 (1 - Φ[7.6]) < Comment: One has to assume that in each case the null hypothesis is that the parameter is zero. One could square my Wald Statistics and instead compare to the Chi-Square Distribution with one degree of freedom, and get the same p-values. For example, (-0.039/0.021) 2 = < p-value is greater than 5%. It would have been better if false had been underlined in the question.

41 38. You are given the following probability density function for a single random variable, X: θ 1/ 2 f(xiθ) = exp[- 2πx3 θ(x -1)2 ] 2x Consider the following statements: 1. f(x) is a member of the exponential family of distributions II. The score function, U(θ), is: U(θ) = 1 2θ - (x -1)2 2x III. The Fisher Information, I(θ), is: I(θ) = 2θ 2 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 40 Determine which of the above statements are true. A. I only B. II only C. III only D. I, II and III E. The correct answer isn't given by A, B, C or D 38. E. lnf(x) = ln[θ]/2 - ln[2πx 3 ]/2 - θ(x -1)2 2x We want: lnf(x) = a(x) b(θ) + c(θ) + d(x). This density is an exponential family with: a(x) = -(x-1) 2 /(2x), b(θ) = θ, c(θ) = ln[θ]/2, and d(x) = -ln[2πx 3 ]/2. Statement I is true.. U(θ) = lnf(x) θ = 1 2θ - (x -1)2 2x. Statement II is true. (x -1)2 Alternately, U(θ) = a(x) bʼ(θ) + cʼ(θ) = {- }(1) + 1 2x 2θ. Statement II is true. 2 lnf(x) θ 2 = 1/(2θ 2 ). I(θ) = -E[ 2 lnf(x) θ 2 ] = 1/(2θ 2 ). Statement III is not true. Alternately, I(θ) = b''(θ) c'(θ) b'(θ) - c (θ) = (0)( 1 2θ ) / 1 - {-1/(2θ2 )} = 1/(2θ 2 ). Statement III is not true. Comment: An Inverse Gaussian Distribution with µ = 1, for x > 0. It is an exponential family, but not a linear exponential family since a(x) x.

42 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page You are performing a two-factor analysis of variance with interactions, and are given the following ANOVA table: Source of variation Degrees of freedom Sum of squares Levels of Factor Levels of Factor Interactions Residuals Calculate the F-statistic for testing the significance of the interaction terms. A. Less than 2.0 B. At least 2.0, but less than 4.0 C. At least 4.0, but less than 6.0 D. At least 6.0, but less than 8.0 E. At least C. The mean sum of squares for interactions is: /4 = The mean sum of squares for residuals is: /90 = F-Statistic is: /8.026 = 5.928, with 4 and 90 degrees of freedom. Comment: Similar to Table 6.12 in Dobson and Barnett. Using a computer, the p-value is 0.03%. The mean sum of squares for Factor 1 is: /4 = In order to test whether the level of Factor 1 are significantly different, the F-Statistic is: /8.026 = 47.19, with 4 and 90 degrees of freedom. The p-value is less than The mean sum of squares for Factor 2 is: /1 = In order to test whether the level of Factor 2 are significantly different, the F-Statistic is: /8.026 = 29.95, with 1 and 90 degrees of freedom. The p-value is 4 x 10-7.

43 40. You are given the following information to find the evidence of interaction between two variables, α and β. Multiple models are fitted to a dataset and their results are shown in the following table: Model Scaled deviance Degrees of Freedom µ 26 9 µ + α j + β k µ + α j + β k + (αβ) jk 5 4 Calculate the F-statistic for testing the significance of the interaction terms. A. Less than 0.5 B. At least 0.5, but less than 2.5 C. At least 2.5, but less than 3.0 D. At least 3.0, but less than 3.5 E. At least A. For a regression, the scaled deviance is another name for the Error Sum of Squares. To test whether there is an interaction of effects, compare the model with interactions to the model without interactions. Compute the F-Statistic: (ESS R - ESS UR )/ (difference in number of parameters for the two models) ESS UR / (total number of data points - number of fitted parameters unrestricted model) = = (6.07-5) / (6-4) 5 / 4 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 42 = 0.428, with 2 and 4 degrees of freedom. Comment: Using a computer, the p-value is 67.9%.

44 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page A modeler is considering revising a GLM for claim counts with age as an explanatory variable. It is currently being included in the model as a continuous variable with no interactions. Determine which of the following statements is false. A. Including age as a categorical variable with more than two levels would increase the model degrees of freedom. B. Including a polynomial term age 2 may decrease model deviance. C. A plot of age against the residuals may be used to assess if a transformation of age is necessary. D. One way of assessing multicollinearity is by performing a regression of age against all other explanatory variables. E. Including several interaction terms involving age may make the model more parsimonious. 41. E. The degrees of freedom of the model are the number of fitted betas. The continuous variable age adds one degree of freedom to the model. A discrete age would instead add levels - 1 to the degrees of freedom to the model. Thus Statement A is true. Adding an additional term to a model usually improves the fit and thus decreases the deviance. Thus Statement B is true. If there is a pattern of the residuals with age, then we have not explained all of the variation of the response with age. In that case, a transformation of age may be useful. Statement C is true. See page 35 and Figures 2.3 and 2.4 of Dobson and Barnett. The variance inflation factor, one way of assessing multicollinearity, is calculated by performing a regression of age against all other explanatory variables. VIFj = 1 / {1 - R(j) 2 }, where R(j) 2 is the coefficient of determination from this regression. If one or more of the VIFs is large, that is an indication of multicollinearity. Thus statement D is true. A parsimonious model uses the fewest number of parameters that get the job done. Adding terms to a model makes it less parsimonious. Thus Statement E is false. Comment: It would have been better if false had been underlined in the question. Parsimonious is used here to mean stingy or frugal. See page 36 of Dobson and Barnett. Occamʼs razor or the law of parsimony states that no more causes should be assumed than are needed to account for an effect; in modeling we should not use more parameters than are needed.

45 42. You are given the following ordered sample of size 6 from a time series: Calculate the sample lag 2 autocorrelation. A. Less than -0.6 B. At least -0.6, but less than -0.3 C. At least -0.3, but less than 0.0 D. At least 0.0, but less than 0.3 E. At least C. Mean is c 0 = (1-1.45) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 + ( ) 2 c 2 = 6 = (1-1.45)( ) + ( )( ) + ( )( ) + ( )( ) 6 = r 2 = c 2 / c 0 = / = Comment: One can use the stat function on the calculator to determine σ x 2, the biased estimator of the variance, which is c 0. Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 44

46 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page An ARMA(3,0) model is fit to the following quarterly time series: Year Quarter 1 Quarter 2 Quarter 3 Quarter The estimated coefficients are: ar1 ar2 ar3 Intercept Forecast the value for Quarter 1 of A. Less than 3.00 B. At least 3.00, but less than 3.25 C. At least 3.25, but less than 3.50 D. At least 3.50, but less than 3.75 E. At least A. (See comment) This AR(3) model, where each value has the intercept subtracted: x t = (0.252)(x t ) + (0.061)(x t ) + (-0.202)(x t ). Forecasted value for Quarter 1 of 2016 is: (0.252)( ) + (0.061)( ) + (-0.202)( ) = Comment: The formula used to describe the results of an autoregressive time series model could be interpreted as a simple OLS model with lagged variables, leading to answer B. The intent of the problem was to test candidatesʼ understanding of how a mean reverting time series model is constructed using an autoregressive model works, leading to the original posted answer A. However, given the ambiguity in the problem stem, the CAS decided to accept both A and B as correct answers. Personally, I do not see any ambiguity in this exam question. Answer B was gotten as follows: (0.252)(2.93) + (0.061)(4.62) + (-0.202)(2.12) = The output of fitting in R an AR(3) model to the given time series: ar1 ar2 ar3 intercept s.e sigma^2 estimated as 1.229: log likelihood = , aic = Other than the intercept, none of the coefficients is significantly different from zero. The output of fitting in R an AR(1) model to the given time series: ar1 intercept s.e sigma^2 estimated as 1.28: log likelihood = , aic = The AR(1) model has a better (smaller) AIC than the AR(3) model.

47 44. You are given the AR(3) model below for z t, a company's revenue for year t: z t = z t z t-3 + w t where w t is white noise, with: E(w t ) = 0 Var(w t ) = σ 2 Solutions Fall 2016 CAS Exam S, HCM 11/30/16, Page 46 The revenues for the last 4 years are as follows: Year Revenue Forecast the expected revenue for A. Less than 19 B. At least 19, but less than 20 C. At least 20, but less than 21 D. At least 21, but less than 22 E. At least E. Forecasted value for 2016: 5 + (0.85)(19) + (-0.02)(15) = Then the forecasted value for 2017: 5 + (0.85)(20.85) + (-0.02)(19) = Comment: One could rewrite the given model as: z t = 0.85(z t ) (z t ) + w t, where the intercept of = 5 / ( ).

Solutions to the Spring 2015 CAS Exam ST

Solutions to the Spring 2015 CAS Exam ST Solutions to the Spring 2015 CAS Exam ST (updated to include the CAS Final Answer Key of July 15) There were 25 questions in total, of equal value, on this 2.5 hour exam. There was a 10 minute reading

More information

Solutions to the Spring 2017 CAS Exam S

Solutions to the Spring 2017 CAS Exam S Solutions to the Spring 2017 CAS Exam S (Incorporating the Final CAS Answer Key) There were 45 questions in total, of equal value, on this 4 hour exam. There was a 15 minute reading period in addition

More information

Solutions to the Fall 2017 CAS Exam S

Solutions to the Fall 2017 CAS Exam S Solutions to the Fall 2017 CAS Exam S (Incorporating the Final CAS Answer Key) There were 45 questions in total, of equal value, on this 4 hour exam. There was a 15 minute reading period in addition to

More information

Solutions to the Spring 2018 CAS Exam MAS-1

Solutions to the Spring 2018 CAS Exam MAS-1 !! Solutions to the Spring 2018 CAS Exam MAS-1 (Incorporating the Final CAS Answer Key) There were 45 questions in total, of equal value, on this 4 hour exam. There was a 15 minute reading period in addition

More information

Practice Exam #1 CAS Exam 3L

Practice Exam #1 CAS Exam 3L Practice Exam #1 CAS Exam 3L These practice exams should be used during the last month prior to your exam. This practice exam contains 25 questions, of equal value, corresponding to an exam of about 2.5

More information

Solutions to the Fall 2018 CAS Exam MAS-1

Solutions to the Fall 2018 CAS Exam MAS-1 Solutions to the Fall 2018 CAS Exam MAS-1 (Incorporating the Preliminary CAS Answer Key) There were 45 questions in total, of equal value, on this 4 hour exam. There was a 15 minute reading period in addition

More information

Solutions to the Spring 2016 CAS Exam S

Solutions to the Spring 2016 CAS Exam S Solutions to the Spring 2016 CAS Exam S There were 45 questions in total, of equal value, on this 4 hour exam. There was a 15 minute reading period in addition to the 4 hours. The Exam S is copyright 2016

More information

CAS Exam MAS-1 Practice Exam #1

CAS Exam MAS-1 Practice Exam #1 CAS Exam MAS-1 Practice Exam #1 These practice exams should be used during the month prior to your exam. This practice exam contains 45 questions, of equal value, corresponding to about a 4 hour exam.

More information

CAS Exam MAS-1. Howard Mahler. Stochastic Models

CAS Exam MAS-1. Howard Mahler. Stochastic Models CAS Exam MAS-1 Howard Mahler Stochastic Models 2019 Stochastic Models, HCM 12/31/18, Page 1 The slides are in the same order as the sections of my study guide. Section # Section Name 1 Introduction 2 Exponential

More information

Course 4 Solutions November 2001 Exams

Course 4 Solutions November 2001 Exams Course 4 Solutions November 001 Exams November, 001 Society of Actuaries Question #1 From the Yule-Walker equations: ρ φ + ρφ 1 1 1. 1 1+ ρ ρφ φ Substituting the given quantities yields: 0.53 φ + 0.53φ

More information

Errata, Mahler Study Aids for Exam S, 2017 HCM, 10/24/17 Page 1. 1, page 2, in Table of Contents: Section 33 is Specific Examples of Markov Processes.

Errata, Mahler Study Aids for Exam S, 2017 HCM, 10/24/17 Page 1. 1, page 2, in Table of Contents: Section 33 is Specific Examples of Markov Processes. Errata, Mahler Study Aids for Exam S, 2017 HCM, 10/24/17 Page 1 1, page 2, in Table of Contents: Section 33 is Specific Examples of Markov Processes. 1, page 24, solution ot the exercise: (60%)(100) +

More information

Master s Written Examination - Solution

Master s Written Examination - Solution Master s Written Examination - Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2

More information

3 Continuous Random Variables

3 Continuous Random Variables Jinguo Lian Math437 Notes January 15, 016 3 Continuous Random Variables Remember that discrete random variables can take only a countable number of possible values. On the other hand, a continuous random

More information

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes: Practice Exam 1 1. Losses for an insurance coverage have the following cumulative distribution function: F(0) = 0 F(1,000) = 0.2 F(5,000) = 0.4 F(10,000) = 0.9 F(100,000) = 1 with linear interpolation

More information

SPRING 2007 EXAM C SOLUTIONS

SPRING 2007 EXAM C SOLUTIONS SPRING 007 EXAM C SOLUTIONS Question #1 The data are already shifted (have had the policy limit and the deductible of 50 applied). The two 350 payments are censored. Thus the likelihood function is L =

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

1. If X has density. cx 3 e x ), 0 x < 0, otherwise. Find the value of c that makes f a probability density. f(x) =

1. If X has density. cx 3 e x ), 0 x < 0, otherwise. Find the value of c that makes f a probability density. f(x) = 1. If X has density f(x) = { cx 3 e x ), 0 x < 0, otherwise. Find the value of c that makes f a probability density. 2. Let X have density f(x) = { xe x, 0 < x < 0, otherwise. (a) Find P (X > 2). (b) Find

More information

Regression of Time Series

Regression of Time Series Mahlerʼs Guide to Regression of Time Series CAS Exam S prepared by Howard C. Mahler, FCAS Copyright 2016 by Howard C. Mahler. Study Aid 2016F-S-9Supplement Howard Mahler hmahler@mac.com www.howardmahler.com/teaching

More information

Course 1 Solutions November 2001 Exams

Course 1 Solutions November 2001 Exams Course Solutions November Exams . A For i =,, let R = event that a red ball is drawn form urn i i B = event that a blue ball is drawn from urn i. i Then if x is the number of blue balls in urn, ( R R)

More information

RMSC 2001 Introduction to Risk Management

RMSC 2001 Introduction to Risk Management RMSC 2001 Introduction to Risk Management Tutorial 4 (2011/12) 1 February 20, 2012 Outline: 1. Failure Time 2. Loss Frequency 3. Loss Severity 4. Aggregate Claim ====================================================

More information

A few basics of credibility theory

A few basics of credibility theory A few basics of credibility theory Greg Taylor Director, Taylor Fry Consulting Actuaries Professorial Associate, University of Melbourne Adjunct Professor, University of New South Wales General credibility

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Exam C Solutions Spring 2005

Exam C Solutions Spring 2005 Exam C Solutions Spring 005 Question # The CDF is F( x) = 4 ( + x) Observation (x) F(x) compare to: Maximum difference 0. 0.58 0, 0. 0.58 0.7 0.880 0., 0.4 0.680 0.9 0.93 0.4, 0.6 0.53. 0.949 0.6, 0.8

More information

Testing and Model Selection

Testing and Model Selection Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses

More information

GLM I An Introduction to Generalized Linear Models

GLM I An Introduction to Generalized Linear Models GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March Presented by: Tanya D. Havlicek, ACAS, MAAA ANTITRUST Notice The Casualty Actuarial Society is committed

More information

Lecture 14: Introduction to Poisson Regression

Lecture 14: Introduction to Poisson Regression Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why

More information

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week

More information

ASM Study Manual for Exam P, Second Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata

ASM Study Manual for Exam P, Second Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata ASM Study Manual for Exam P, Second Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA (krzysio@krzysio.net) Errata Effective July 5, 3, only the latest edition of this manual will have its errata

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

Frequency Distributions

Frequency Distributions Mahlerʼs Guide to Frequency Distributions Exam STAM prepared by Howard C. Mahler, FCAS Copyright 2018 by Howard C. Mahler. Study Aid 2019-STAM-1 Howard Mahler hmahler@mac.com www.howardmahler.com/teaching

More information

Advanced Ratemaking. Chapter 27 GLMs

Advanced Ratemaking. Chapter 27 GLMs Mahlerʼs Guide to Advanced Ratemaking CAS Exam 8 Chapter 27 GLMs prepared by Howard C. Mahler, FCAS Copyright 2016 by Howard C. Mahler. Study Aid 2016-8 Howard Mahler hmahler@mac.com www.howardmahler.com/teaching

More information

RMSC 2001 Introduction to Risk Management

RMSC 2001 Introduction to Risk Management RMSC 2001 Introduction to Risk Management Tutorial 4 (2011/12) 1 February 20, 2012 Outline: 1. Failure Time 2. Loss Frequency 3. Loss Severity 4. Aggregate Claim ====================================================

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

ASM Study Manual for Exam P, First Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata

ASM Study Manual for Exam P, First Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata ASM Study Manual for Exam P, First Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA (krzysio@krzysio.net) Errata Effective July 5, 3, only the latest edition of this manual will have its errata

More information

Statistics & Data Sciences: First Year Prelim Exam May 2018

Statistics & Data Sciences: First Year Prelim Exam May 2018 Statistics & Data Sciences: First Year Prelim Exam May 2018 Instructions: 1. Do not turn this page until instructed to do so. 2. Start each new question on a new sheet of paper. 3. This is a closed book

More information

Linear Regression Models P8111

Linear Regression Models P8111 Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started

More information

Probability Distributions Columns (a) through (d)

Probability Distributions Columns (a) through (d) Discrete Probability Distributions Columns (a) through (d) Probability Mass Distribution Description Notes Notation or Density Function --------------------(PMF or PDF)-------------------- (a) (b) (c)

More information

A Practitioner s Guide to Generalized Linear Models

A Practitioner s Guide to Generalized Linear Models A Practitioners Guide to Generalized Linear Models Background The classical linear models and most of the minimum bias procedures are special cases of generalized linear models (GLMs). GLMs are more technically

More information

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION

More information

Generalized Linear Models 1

Generalized Linear Models 1 Generalized Linear Models 1 STA 2101/442: Fall 2012 1 See last slide for copyright information. 1 / 24 Suggested Reading: Davison s Statistical models Exponential families of distributions Sec. 5.2 Chapter

More information

Stat 5102 Final Exam May 14, 2015

Stat 5102 Final Exam May 14, 2015 Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions

More information

57:022 Principles of Design II Final Exam Solutions - Spring 1997

57:022 Principles of Design II Final Exam Solutions - Spring 1997 57:022 Principles of Design II Final Exam Solutions - Spring 1997 Part: I II III IV V VI Total Possible Pts: 52 10 12 16 13 12 115 PART ONE Indicate "+" if True and "o" if False: + a. If a component's

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Suggested solutions to written exam Jan 17, 2012

Suggested solutions to written exam Jan 17, 2012 LINKÖPINGS UNIVERSITET Institutionen för datavetenskap Statistik, ANd 73A36 THEORY OF STATISTICS, 6 CDTS Master s program in Statistics and Data Mining Fall semester Written exam Suggested solutions to

More information

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS INSTITUTE AND FACULTY OF ACTUARIES Curriculum 09 SPECIMEN SOLUTIONS Subject CSA Risk Modelling and Survival Analysis Institute and Faculty of Actuaries Sample path A continuous time, discrete state process

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability and Statistics FS 2017 Session Exam 22.08.2017 Time Limit: 180 Minutes Name: Student ID: This exam contains 13 pages (including this cover page) and 10 questions. A Formulae sheet is provided

More information

Brief Review of Probability

Brief Review of Probability Maura Department of Economics and Finance Università Tor Vergata Outline 1 Distribution Functions Quantiles and Modes of a Distribution 2 Example 3 Example 4 Distributions Outline Distribution Functions

More information

Errata for the ASM Study Manual for Exam P, Fourth Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA

Errata for the ASM Study Manual for Exam P, Fourth Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata for the ASM Study Manual for Exam P, Fourth Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA (krzysio@krzysio.net) Effective July 5, 3, only the latest edition of this manual will have its

More information

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will

More information

Ph.D. Preliminary Examination Statistics June 2, 2014

Ph.D. Preliminary Examination Statistics June 2, 2014 Ph.D. Preliminary Examination Statistics June, 04 NOTES:. The exam is worth 00 points.. Partial credit may be given for partial answers if possible.. There are 5 pages in this exam paper. I have neither

More information

Continuous random variables

Continuous random variables Continuous random variables Continuous r.v. s take an uncountably infinite number of possible values. Examples: Heights of people Weights of apples Diameters of bolts Life lengths of light-bulbs We cannot

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

Exam 2 Practice Questions, 18.05, Spring 2014

Exam 2 Practice Questions, 18.05, Spring 2014 Exam 2 Practice Questions, 18.05, Spring 2014 Note: This is a set of practice problems for exam 2. The actual exam will be much shorter. Within each section we ve arranged the problems roughly in order

More information

Financial Econometrics

Financial Econometrics Financial Econometrics Estimation and Inference Gerald P. Dwyer Trinity College, Dublin January 2013 Who am I? Visiting Professor and BB&T Scholar at Clemson University Federal Reserve Bank of Atlanta

More information

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014 LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Liang (Sally) Shan Nov. 4, 2014 L Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers

More information

Notes for Math 324, Part 17

Notes for Math 324, Part 17 126 Notes for Math 324, Part 17 Chapter 17 Common discrete distributions 17.1 Binomial Consider an experiment consisting by a series of trials. The only possible outcomes of the trials are success and

More information

STAT 212 Business Statistics II 1

STAT 212 Business Statistics II 1 STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb

More information

Frequency Distributions

Frequency Distributions Mahlerʼs Guide to Frequency Distributions Joint Exam 4/C prepared by Howard C. Mahler, FCAS Copyright 2012 by Howard C. Mahler. Study Aid 2012-4-1 Howard Mahler hmahler@mac.com www.howardmahler.com/teaching

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

Page Max. Possible Points Total 100

Page Max. Possible Points Total 100 Math 3215 Exam 2 Summer 2014 Instructor: Sal Barone Name: GT username: 1. No books or notes are allowed. 2. You may use ONLY NON-GRAPHING and NON-PROGRAMABLE scientific calculators. All other electronic

More information

Exam Applied Statistical Regression. Good Luck!

Exam Applied Statistical Regression. Good Luck! Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.

More information

Statistics 100A Homework 5 Solutions

Statistics 100A Homework 5 Solutions Chapter 5 Statistics 1A Homework 5 Solutions Ryan Rosario 1. Let X be a random variable with probability density function a What is the value of c? fx { c1 x 1 < x < 1 otherwise We know that for fx to

More information

Chapter 5: Generalized Linear Models

Chapter 5: Generalized Linear Models w w w. I C A 0 1 4. o r g Chapter 5: Generalized Linear Models b Curtis Gar Dean, FCAS, MAAA, CFA Ball State Universit: Center for Actuarial Science and Risk Management M Interest in Predictive Modeling

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population

More information

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS QUESTIONS 5.1. (a) In a log-log model the dependent and all explanatory variables are in the logarithmic form. (b) In the log-lin model the dependent variable

More information

Chapter 3 Common Families of Distributions

Chapter 3 Common Families of Distributions Lecture 9 on BST 631: Statistical Theory I Kui Zhang, 9/3/8 and 9/5/8 Review for the previous lecture Definition: Several commonly used discrete distributions, including discrete uniform, hypergeometric,

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

Week 1 Quantitative Analysis of Financial Markets Distributions A

Week 1 Quantitative Analysis of Financial Markets Distributions A Week 1 Quantitative Analysis of Financial Markets Distributions A Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October

More information

Mahler s Guide to Stochastic Models

Mahler s Guide to Stochastic Models HCMSA-S07-3-1A, Stochastic Models, 11/22/06, Page 1 Mahler s Guide to Stochastic Models Copyright 2007 by Howard C. Mahler. Information in bold or sections whose title is in bold are more important for

More information

Notes for Math 324, Part 20

Notes for Math 324, Part 20 7 Notes for Math 34, Part Chapter Conditional epectations, variances, etc.. Conditional probability Given two events, the conditional probability of A given B is defined by P[A B] = P[A B]. P[B] P[A B]

More information

Subject CS1 Actuarial Statistics 1 Core Principles

Subject CS1 Actuarial Statistics 1 Core Principles Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and

More information

Statistics and data analyses

Statistics and data analyses Statistics and data analyses Designing experiments Measuring time Instrumental quality Precision Standard deviation depends on Number of measurements Detection quality Systematics and methology σ tot =

More information

9 Generalized Linear Models

9 Generalized Linear Models 9 Generalized Linear Models The Generalized Linear Model (GLM) is a model which has been built to include a wide range of different models you already know, e.g. ANOVA and multiple linear regression models

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) (b) (c) (d) (e) In 2 2 tables, statistical independence is equivalent

More information

STAT 414: Introduction to Probability Theory

STAT 414: Introduction to Probability Theory STAT 414: Introduction to Probability Theory Spring 2016; Homework Assignments Latest updated on April 29, 2016 HW1 (Due on Jan. 21) Chapter 1 Problems 1, 8, 9, 10, 11, 18, 19, 26, 28, 30 Theoretical Exercises

More information

Master s Written Examination

Master s Written Examination Master s Written Examination Option: Statistics and Probability Spring 016 Full points may be obtained for correct answers to eight questions. Each numbered question which may have several parts is worth

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #27 Estimation-I Today, I will introduce the problem of

More information

Tento projekt je spolufinancován Evropským sociálním fondem a Státním rozpočtem ČR InoBio CZ.1.07/2.2.00/

Tento projekt je spolufinancován Evropským sociálním fondem a Státním rozpočtem ČR InoBio CZ.1.07/2.2.00/ Tento projekt je spolufinancován Evropským sociálním fondem a Státním rozpočtem ČR InoBio CZ.1.07/2.2.00/28.0018 Statistical Analysis in Ecology using R Linear Models/GLM Ing. Daniel Volařík, Ph.D. 13.

More information

ECON 4230 Intermediate Econometric Theory Exam

ECON 4230 Intermediate Econometric Theory Exam ECON 4230 Intermediate Econometric Theory Exam Multiple Choice (20 pts). Circle the best answer. 1. The Classical assumption of mean zero errors is satisfied if the regression model a) is linear in the

More information

Applied Linear Statistical Methods

Applied Linear Statistical Methods Applied Linear Statistical Methods (short lecturenotes) Prof. Rozenn Dahyot School of Computer Science and Statistics Trinity College Dublin Ireland www.scss.tcd.ie/rozenn.dahyot Hilary Term 2016 1. Introduction

More information

If we want to analyze experimental or simulated data we might encounter the following tasks:

If we want to analyze experimental or simulated data we might encounter the following tasks: Chapter 1 Introduction If we want to analyze experimental or simulated data we might encounter the following tasks: Characterization of the source of the signal and diagnosis Studying dependencies Prediction

More information

Regression Model Building

Regression Model Building Regression Model Building Setting: Possibly a large set of predictor variables (including interactions). Goal: Fit a parsimonious model that explains variation in Y with a small set of predictors Automated

More information

SPRING 2005 EXAM M SOLUTIONS. = When (as given in the problem), (x) dies in the second year from issue, the curtate future lifetime x + 1 is 0, so

SPRING 2005 EXAM M SOLUTIONS. = When (as given in the problem), (x) dies in the second year from issue, the curtate future lifetime x + 1 is 0, so SPRING 005 EXAM M SOLUTIONS Question # Key: B Let K be the curtate future lifetime of (x + k) K + k L 000v 000Px :3 ak + When (as given in the problem), (x) dies in the second year from issue, the curtate

More information

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs STATISTICS 4 Summary Notes. Geometric and Exponential Distributions GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs P(X = x) = ( p) x p x =,, 3,...

More information

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn Parameter estimation and forecasting Cristiano Porciani AIfA, Uni-Bonn Questions? C. Porciani Estimation & forecasting 2 Temperature fluctuations Variance at multipole l (angle ~180o/l) C. Porciani Estimation

More information

Masters Comprehensive Examination Department of Statistics, University of Florida

Masters Comprehensive Examination Department of Statistics, University of Florida Masters Comprehensive Examination Department of Statistics, University of Florida May 6, 003, 8:00 am - :00 noon Instructions: You have four hours to answer questions in this examination You must show

More information

Inference and Regression

Inference and Regression Inference and Regression Assignment 3 Department of IOMS Professor William Greene Phone:.998.0876 Office: KMC 7-90 Home page:www.stern.nyu.edu/~wgreene Email: wgreene@stern.nyu.edu Course web page: www.stern.nyu.edu/~wgreene/econometrics/econometrics.htm.

More information

Reliability Engineering I

Reliability Engineering I Happiness is taking the reliability final exam. Reliability Engineering I ENM/MSC 565 Review for the Final Exam Vital Statistics What R&M concepts covered in the course When Monday April 29 from 4:30 6:00

More information

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test

More information

Notes for Math 324, Part 19

Notes for Math 324, Part 19 48 Notes for Math 324, Part 9 Chapter 9 Multivariate distributions, covariance Often, we need to consider several random variables at the same time. We have a sample space S and r.v. s X, Y,..., which

More information

Generalized Linear Models: An Introduction

Generalized Linear Models: An Introduction Applied Statistics With R Generalized Linear Models: An Introduction John Fox WU Wien May/June 2006 2006 by John Fox Generalized Linear Models: An Introduction 1 A synthesis due to Nelder and Wedderburn,

More information

STAT 418: Probability and Stochastic Processes

STAT 418: Probability and Stochastic Processes STAT 418: Probability and Stochastic Processes Spring 2016; Homework Assignments Latest updated on April 29, 2016 HW1 (Due on Jan. 21) Chapter 1 Problems 1, 8, 9, 10, 11, 18, 19, 26, 28, 30 Theoretical

More information

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018 Mathematics Ph.D. Qualifying Examination Stat 52800 Probability, January 2018 NOTE: Answers all questions completely. Justify every step. Time allowed: 3 hours. 1. Let X 1,..., X n be a random sample from

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

MFin Econometrics I Session 5: F-tests for goodness of fit, Non-linearity and Model Transformations, Dummy variables

MFin Econometrics I Session 5: F-tests for goodness of fit, Non-linearity and Model Transformations, Dummy variables MFin Econometrics I Session 5: F-tests for goodness of fit, Non-linearity and Model Transformations, Dummy variables Thilo Klein University of Cambridge Judge Business School Session 5: Non-linearity,

More information

STAT 7030: Categorical Data Analysis

STAT 7030: Categorical Data Analysis STAT 7030: Categorical Data Analysis 5. Logistic Regression Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2012 Peng Zeng (Auburn University) STAT 7030 Lecture Notes Fall 2012

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information