INSTITUTE OF ACTUARIES OF INDIA

Similar documents
INSTITUTE OF ACTUARIES OF INDIA

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

INSTITUTE OF ACTUARIES OF INDIA

HWA CHONG INSTITUTION 2018 JC2 PRELIMINARY EXAMINATION. Monday 17 September hours

Notes for Math 324, Part 20

Institute of Actuaries of India

Institute of Actuaries of India

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E

Thursday 8 June 2017 Morning Time allowed: 1 hour 30 minutes

Solutions to the Spring 2015 CAS Exam ST

Ph.D. Preliminary Examination Statistics June 2, 2014

ORF 245 Fundamentals of Engineering Statistics. Final Exam

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

SMAM 314 Practice Final Examination Winter 2003

Course 1 Solutions November 2001 Exams

Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you.

Wednesday 8 June 2016 Morning

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY (formerly the Examinations of the Institute of Statisticians) HIGHER CERTIFICATE IN STATISTICS, 2004

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY (formerly the Examinations of the Institute of Statisticians) HIGHER CERTIFICATE IN STATISTICS, 2000

ANOVA - analysis of variance - used to compare the means of several populations.

Errata for the ASM Study Manual for Exam P, Fourth Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA

physicsandmathstutor.com Paper Reference Advanced/Advanced Subsidiary Thursday 9 June 2005 Morning Time: 1 hour 30 minutes

Concordia University (5+5)Q 1.

ASM Study Manual for Exam P, Second Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata

Ch 13 & 14 - Regression Analysis

Mathematical Methods 2017 Sample paper

Paper Reference. Statistics S4 Advanced/Advanced Subsidiary. Friday 19 June 2009 Afternoon Time: 1 hour 30 minutes

Stat 500 Midterm 2 8 November 2007 page 0 of 4

ASM Study Manual for Exam P, First Edition By Dr. Krzysztof M. Ostaszewski, FSA, CFA, MAAA Errata

MEI STRUCTURED MATHEMATICS 4767

The simple linear regression model discussed in Chapter 13 was written as

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,

MEI STRUCTURED MATHEMATICS STATISTICS 2, S2. Practice Paper S2-B

Notes for Math 324, Part 17

THE UNIVERSITY OF HONG KONG School of Economics & Finance Answer Keys to st Semester Examination

Standard Error of Technical Cost Incorporating Parameter Uncertainty

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Please, read the following instructions carefully:

HWA CHONG INSTITUTION 2016 JC2 PRELIMINARY EXAMINATION. Tuesday 20 September hours. List of Formula (MF15)

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Swarthmore Honors Exam 2012: Statistics

This paper is not to be removed from the Examination Halls

2016 Preliminary Examination II Pre-University 3

Introduction to Simple Linear Regression

Regression Estimation - Least Squares and Maximum Likelihood. Dr. Frank Wood

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018

UNIVERSITY OF SWAZILAND SUPPLEMENTARY EXAMINATION PAPER 2015 CALCULATOR AND STATISTICAL TABLES

Six Sigma Black Belt Study Guides

4306/2H. General Certificate of Secondary Education November MATHEMATICS (SPECIFICATION A) 4306/2H Higher Tier Paper 2 Calculator

Paper Reference. Paper Reference(s) 6684/01 Edexcel GCE Statistics S2 Advanced/Advanced Subsidiary

Paper Reference. Statistics S2 Advanced/Advanced Subsidiary. Monday 11 June 2007 Afternoon Time: 1 hour 30 minutes

Subject CS1 Actuarial Statistics 1 Core Principles

GB2 Regression with Insurance Claim Severities

Statistical inference

Final Exam - Solutions

Practice Exam #1 CAS Exam 3L

November 2000 Course 1. Society of Actuaries/Casualty Actuarial Society

; y OA3009,. UNIVERSITY OF CAMBRIDGE LOCAL EXAMINATIONS SYNDICATE ""'

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0

Non-Life Insurance: Mathematics and Statistics

ACTEX CAS EXAM 3 STUDY GUIDE FOR MATHEMATICAL STATISTICS

You are permitted to use your own calculator where it has been stamped as approved by the University.

Ratemaking with a Copula-Based Multivariate Tweedie Model

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

M2S2 Statistical Modelling

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.

Correlation and Regression

Chapter 12: Inference about One Population

Candidate Name Centre Number Candidate Number MATHEMATICS UNIT 1: NON-CALCULATOR INTERMEDIATE TIER SPECIMEN PAPER SUMMER 2017

Midterm 2 - Solutions

8 LEVELS 5 7 PAPER. Paper 2. Year 8 mathematics test. Calculator allowed. First name. Last name. Class. Date YEAR

STA2601. Tutorial Letter 104/1/2014. Applied Statistics II. Semester 1. Department of Statistics STA2601/104/1/2014 TRIAL EXAMINATION PAPER

STA 2101/442 Assignment 2 1

Tuesday 21 June 2016 Morning Time allowed: 1 hour 30 minutes

Part Possible Score Base 5 5 MC Total 50

You are allowed two hours to answer this question paper. All questions are compulsory.

QUESTION ONE Let 7C = Total Cost MC = Marginal Cost AC = Average Cost

Mathematics (Linear) 4365/1H

SPRING 2007 EXAM C SOLUTIONS

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 12th Class 6/23/10

The external assessment requirements of this subject are listed on page 20. DRAFT

SLOW LEARNERS MATERIALS BUSINESS MATHEMATICS SIX MARKS QUESTIONS

MEI STRUCTURED MATHEMATICS STATISTICS 2, S2. Practice Paper S2-A

Mathematics (Project Maths Phase 1)

Thursday 25 May 2017 Morning

SCHOOL OF MATHEMATICS AND STATISTICS

DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 2003 MOCK EXAMINATIONS IOP 201-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH)

(Jun09GEOG201) General Certificate of Education Advanced Subsidiary Examination June Geographical Skills PMT TOTAL. Time allowed 1 hour

CHAPTER 15 PROBABILITY Introduction

MATH 644: Regression Analysis Methods

Module 1 Linear Regression

Some Assorted Formulae. Some confidence intervals: σ n. x ± z α/2. x ± t n 1;α/2 n. ˆp(1 ˆp) ˆp ± z α/2 n. χ 2 n 1;1 α/2. n 1;α/2

Homework 2: Simple Linear Regression

Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 2 Fall 2016 Instructor: Martin Farnham

Edexcel GCE Statistics S1 Advanced/Advanced Subsidiary

AP Statistics Review Ch. 7

( ) A company found that its monthly profit, P, is given by 1. ( ) ( )

Transcription:

INSTITUTE OF ACTUARIES OF INDIA EXAMINATIONS 19 th November 2012 Subject CT3 Probability & Mathematical Statistics Time allowed: Three Hours (15.00 18.00) Total Marks: 100 INSTRUCTIONS TO THE CANDIDATES 1. Please read the instructions on the front page of answer booklet and instructions to examinees sent along with hall ticket carefully and follow without exception 2. Mark allocations are shown in brackets. 3. Attempt all questions, beginning your answer to each question on a separate sheet. However, answers to objective type questions could be written on the same sheet. 4. Please check if you have received complete Question Paper and no page is missing. If so, kindly get new set of Question Paper from the Invigilator. AT THE END OF THE EXAMINATION Please return your answer book and this question paper to the supervisor separately.

Q. 1) The probability model for the distribution of claims per policy for an Accidental Disability and Dismemberment Health plan are given in the table below. Independence of claim amounts and occurrences among claim types and claimants is assumed. Claim Type Probability of Claim Distribution of claim amounts given that a claim occurs Mean Standard Deviation Disability 0.35 50,000 10,000 Dismemberment 0.65 30,000 7,000 Calculate the mean and the standard deviation of the claim amounts per policy. [6] Q. 2) A measure of skewness is defined as: ψ = mean mode standard deviation. Find the value of ψ for a gamma distribution with parameters α = 2.5 and λ = 0.4. [4] Q. 3) A random sample of basic monthly salary (x) of 12 employees of a multinational organization has been selected. The summary of the data is as follows: x = 213,200 ; x 2 = 4,919,860,000 ; Sample Mean = 17,767 ; Sample Standard Deviation = 10,144. However later it was identified that two of these selected employees were temporary employees and needs to be replaced with two permanent employees. Calculate the sample mean and standard deviation of the revised sample if the salary of the two temporary employees is 11,000 and 48,000 and salary of the two randomly selected permanent employees is 27,500 and 31,500. (You should show intermediate workings) Comment on your answers. [7] Page 2 of 9

Q.4) An auto insurance company charges younger drivers a higher premium than it does older drivers because younger drivers as a group tend to have more accidents. The company analyses its experience data using 3 age groups: Group A includes those less than 25 years old, 22% of all its policyholders. Group B includes those 25-39 years old, 43% of all its policyholders, Group C includes those 40 years old and older. The probabilities of a claim due to an accident in any one year period for a policyholder belonging to Group A, B or C are 11%, 3% and 2%, respectively. [i] What percentage of the company s policyholders is expected to make a claim due to an accident during the next 12 months? Suppose Mr X has just had a car accident and goes on to register a claim against his insurance policy. If he is one of the company s policyholders, what is the probability that [3] he is under 25? [5] Q.5) [i] Let the moment generating function (MGF) of a random variable X bem X (t). Show that: (a) E X = M X (0) (b) Var X = M X 0 M X(0) 2. Let Z be the sum of two independent random variables X 1 and X 2. Show that the MGF of Z is the product of the MGFs of X 1 and X 2. [3] [iii] A company insures homes in three cities, J, K and L. The losses occurring in these cities are independent. The moment-generating functions for the loss distributions of the cities are: M J t = (1 2t) 3 ; M K t = (1 2t) 2.5 ; M L t = (1 2t) 4.5 Let Y represent the combined losses from the three cities. Compute E(Y) and Var(Y). [5] [10] Page 3 of 9

Q.6) Let X 1, X 2 X n be a random sample from Uniform distribution over (0, θ), where θ is an unknown parameter (> 0). [i] Outline why the Cramér-Rao lower bound for the variance of unbiased estimators of θ does not apply in this case. Consider an estimator of θ: θ c = c Y for some constant c where Y = max X i i Show that the probability density function of Y is given as: g Y y = n θ n. yn 1 for 0 < y < θ [1] Hence, show that: E Y k n θk = n + k for any non negative real number k [5] [iii] Show that the bias and mean square error (MSE) of the estimatorθ c are given as follows: c n Bias θ c = n + 1 1. θ n θ2 MSE θ c = c 2 2n θ2. c. n + 2 n + 1 + θ2 [4] [iv] Find the value of c (=c u ) for which θ c becomes an unbiased estimator of θ. [1] [v] Find the value of c(=c m ) for which the mean square error of θ c is minimised. [vi] Which of the two estimators θ c u orθ c m will you prefer for estimating θ? Give reasons. What happens when n is large? [15] Q.7) In a pre-match program, 280 callers (random callers) were asked which team they think will win the upcoming cricket match between Australia and England. If 40% of the population to which these callers belong thinks that team England will be the winner, calculate (approximately) the probability that at least 106 callers think that England will win. [3] Page 4 of 9

Q.8) The number of breakages in a damaged chromosome X follows a discrete distribution with an unknown parameter λ (> 0). The probability mass function of X is given by: P X = k = e λ λk. 1 e λ k!, k = 1, 2, A random sample of 33 damaged chromosomes was studied and the number of breakages in each chromosome was recorded in the form of a frequency table as follows: Number of Breakages Number of Chromosomes Number of Breakages Number of Chromosomes 1 11 8 2 2 6 9 1 3 4 10 0 4 5 11 1 5 0 12 1 6 1 13 1 7 0 14+ 0 [i] Find an equation satisfied by λ, the maximum likelihood estimator of λ. It is found using the observed data, the maximum likelihood estimate of λ isλ = 3.6. Using this value of λ, test the null hypothesis that the number of breakages in a damaged chromosome follows the given probability distribution. [5] [7] [12] Page 5 of 9

Q.9) The following data (x) are the number of germinations per square foot observed in an experiment where a particular type of plant seed was applied at four different rates. [i] Rate of Application 1 2 3 4 29 180 332 910 13 90 444 880 21 120 190 460 Mean 21 130 322 750 Variance 64 2,100 16,204 63,300 State the assumptions required for a one-way analysis of variance (ANOVA) and whether these data appear to violate any of them. Give reasons. An agricultural scientist working on this data applied the following three transformations to the data: x, log e x and 1 x. The following table contains values for the means and variances of the transformed data although he forgot to fill in a couple of them and thus is missing (the ones marked as ****): Rate Transformation mean variance mean variance mean variance 1 **** 0.79 2.992 0.1630 0.05301 0.0004721 2 11.29 3.94 4.827 **** 0.00833 0.0000077 3 17.69 13.49 5.716 0.1861 0.00351 0.0000025 4 27.09 23.96 6.575 0.1479 0.00147 0.0000004 [iii] [iv] [v] Complete the table for him by determining the missing values. The scientist insisted (in order to perform a one-way ANOVA) we must consider the log e transformation of the data only. Argue heuristically why this is the case? The scientist now decides to go ahead and performs the one-way ANOVA. (a) State the ANOVA model for the transformed data along with the null hypothesis he is testing. (b) Perform the ANOVA for the transformation chosen. (c) What conclusions can be drawn from the ANOVA? Calculate the 95% confidence intervals for the mean values of each of the rates on the chosen transformed scale. Back-transform these intervals onto the original scale. Page 6 of 9 [3] [8] [7] [22]

Q.10) For a study into the density of population around a large city, a random sample of 10 residential areas was selected. For each area the distance (x) in kilometres from the city centre and the population density in hundreds per square kilometre (y) were recorded. The following table shows the data and also the log e of each measurement: x y log e (x) log e (y) 0.4 149-0.916 5.004 1.0 141 0.000 4.949 3.1 102 1.131 4.625 4.5 46 1.504 3.829 4.7 72 1.548 4.277 6.5 40 1.872 3.689 7.3 23 1.988 3.135 8.2 15 2.104 2.708 9.7 7 2.272 1.946 11.7 5 2.460 1.609 Mean 5.710 60.000 1.396 3.577 Variance 13.425 2912.667 1.153 1.451 Three different regression models were considered for analysing this data. Model 1: E Y x = α + βx Model 2: E Y log e x = α + β log e x Model 3: E log e Y x = α + βx Here E. stands for the Expectation of the response variable. The following is the extract from the computer output on the analysis done for Model 1: Page 7 of 9

y y IAI CT3 1112 The regression equation: Parameter Estimate s.e α *** 11.120 β -13.958 *** Parameter t p-value α 12.56 0.000 β *** 0.000 *** 175 150 125 100 75 50 25 0 Scatter plot with Regression line -25 0.0 2.0 4.0 6.0 8.0 10.0 12.0 [ s.e : standard error; t : test statistic value for testing the significance of the coefficient] [i] In the above output some of the entries have gone missing (the ones marked as ***). Making any necessary assumptions, use the available information to compute these missing entries and perform a statistical test for significance of β at the 5% level. The following are similar extracts from the computer output on the analysis done for Models 2 and 3: x Regression Analysis: Model 2 The regression equation: Scatter plot with Regression line Parameter Estimate s.e α 126.990 9.147 β -47.980 5.293 Parameter t p-value α 13.88 0.000 β -9.07 0.000 175 150 125 100 75 50 25 0-25 -1.0-0.5 0.0 0.5 1.0 1.5 2.0 2.5 3.0 ln x [6] Page 8 of 9

ln y IAI CT3 1112 Regression Analysis: Model 3 The regression equation: Parameter Estimate s.e α 5.413 0.162 β -0.322 0.024 Parameter t p-value α 33.40 0.000 β -13.26 0.000 6.0 5.0 4.0 3.0 2.0 Scatter plot with Regression line 1.0 0.0 2.0 4.0 6.0 8.0 10.0 12.0 x [iii] [iv] [v] On the basis of the regression results given above, which of the three regression models would you consider to be the best? Justify your answer by reference to the diagnostic criteria given in the output and relating those to the corresponding plots. For the model you consider to be the best in part ii, write down an expression for y in terms of x. Using the chosen model, estimate the density of the population at a distance of 5 km from the city centre. State any reservations you have about using the model to predict population density. [5] [1] [16] *************************** Page 9 of 9