Master s Written Examination

Size: px
Start display at page:

Download "Master s Written Examination"

Transcription

1 Master s Written Examination Option: Statistics and Probability Spring 05 Full points may be obtained for correct answers to eight questions Each numbered question (which may have several parts) is worth the same number of points All answers will be graded, but the score for the examination will be the sum of the scores of your best eight solutions Use separate answer sheets for each question DO NOT PUT YOUR NAME ON YOUR ANSWER SHEETS When you have finished, insert all your answer sheets into the envelope provided, then seal it

2 Problem Stat 40 X and Y are independent random variables with X exponential(λ) and Y exponential(µ) It is impossible to obtain direct observations of X and Y Instead, we observe the random variables Z and W, where {, Z = X; Z = min{x, Y } and W = 0, Z = Y (a) Find the joint distribution of Z and W (b) Prove that Z and W are independent Solution to Problem (a) We first compute P (Z > z, W = ) P (Z > z, W = ) = P (Z > z, Z = X) Clearly, P (W = ) = P (Z > 0, W = ) = Similarly, we have = P (X > z, X Y ) = = P (Z z, W = ) = z z x λµe λx e µy dydx λe λx e µx dx = λ λ + µ e (λ+µ)z λ Thus λ+µ λ ( ) e (λ+µ)z λ + µ P (Z z, W = 0) = µ ( ) e (λ+µ)z λ + µ (b) It is sufficient to show that P (Z z W = i) = P (Z z) Clearly, we have On the other hand, P (Z z) = e (λ+µ)z P (Z z W = i) = P (Z z, W = i) P (W = i) Utilizing the results in (a), we can directly verify that P (Z z W = i) = P (Z z) Problem Stat 40 Suppose you have a wooden stick with length You first break it into two pieces at a randomly selected point on the stick Then you again break the longer piece at random into two pieces Let X and X be the places where you break the stick the first and second time, respectively Find the joint probability density function of X and X

3 Find the probability that you can make a triangle using the three pieces Solution to Problem Let X and X be the places where you break the stick the first/second time X Unif (0,) Let, x > /, and x (x, ); x f (x x ) =, x < /, and x (0, x ), x Since X Unif (0, ), we have, x > /, and x (x, ); x f(x, x ) = f(x )f (x x ) =, x < /, and x (0, x ), x Now we have two cases x > / In this case, the three pieces have length x, x x and x In order to make a triangle, we need x > x, x + x > x x, and x > x In other words, we need x / < x < / x < / Similarly, the three pieces have length x, x x and x Solving these inequalities gives / < x < x + / Therefore the probability that a triangle can be made using the three pieces is = P(X > /, X / < X < / OR X < /, / < X < X + /) x =/ / x / x dx dx + / /+x x =0 x =/ = log(/) / / log(/) = log 4 x dx dx Problem 3 Stat 4 Let X,, X n be a random sample from X Normal (θ, σ ), θ R is unknown, and σ > 0 is known () Find the maximum likelihood estimator ˆθ mle for mean θ () Is ˆθ mle an efficient estimator for mean θ? (3) Let parameter η = θ, determine its mle ˆη mle and the asymptotic distribution of ˆη mle? Solution to Problem 3 Log likelihood function of the sample is () The likelihood equation is l (θ) = n log ( πσ ) σ l θ = σ (x i θ) (x i θ) = 0 3

4 and its solution is ˆθ = x = n x i, and l θ = n σ < 0, so ˆθ mle = x ) () E (ˆθmle = E ( X) = θ, an unbiased estimator for θ, ) V ar (ˆθmle = V ar ( V ar (X) X) = n = σ n = ni (θ) as Fisher information of the sample is ( ) log f (X, θ) ni (θ) = ne = n θ σ The mle reaches the R-C lower bound, so it is an efficient estimator for θ ) (3) Based on the functional invariant property of the mle, ˆη mle = (ˆθmle = X Based on the asymptotic normal distribution of the mles, we have ( ) n (ˆηmle η) N 0, (η (θ)), I (θ) ie n (ˆη mle η) N (0, 4θ σ ) Problem 4 Stat 4 Let X,, X n and Y,, Y n be independent random samples from two normal distributions N(µ, σ ) and N(µ, σ ), respectively, where σ is the common but unknown variance (a) Find the likelihood ratio for testing H 0 : µ = µ = 0 against all alternatives (b) Rewrite so that it is a function of a statistic Z which has a well-known distribution under null hypotheses (Hint: X i = (X i X + X) ) (c) Give the distribution of Z under null hypotheses Solution to Problem 4 The likelihood function is ( (πσ ) n exp σ ( (X i µ ) + (a) Under the null hypotheses, the MLE of σ is ) (Y j µ ) ) j= ˆσ = X i + j= Y j n 4

5 and ( L(ˆω) = (π) n X i + j= Y ) j n exp ( n) n Under Ω, we have ˆµ = X, ˆµ = Ȳ, ˆσ = (X i X) + j= (Y j Ȳ ), n and Thus, we have (b) Notice that Thus where ( L(ˆΩ) = (π) n (X i X) + j= (Y j Ȳ ) n ) exp ( n) n = L(ˆω) L(ˆΩ) = Xi + j= Y j = Z = ( X i + j= Y j (X i X) + j= (Y j Ȳ ) (X i X) + ) n (Y j Ȳ ) + n X + nȳ j= = ( + Z) n, n X + nȳ (X i X) + j= (Y j Ȳ ) (c) Under null hypotheses, n X /σ χ (), nȳ /σ χ (), (X i X) /σ χ (n ), and j= (Y j Ȳ ) /σ χ (n ) Notice that they are independent with each other Thus (n X + nȳ )/σ χ () and ( (X i X) + j= (Y j Ȳ ) )/σ χ (n ) Consequently, Z F distribution with first df= and second df=n Problem 5 Stat 4 Consider a bivariate observation X = (X, X ) with density f θ (x, x ) = exp{ θx θ x }, x, x, θ > 0 Note: Under this model, X and X are independent (but not iid) exponential random variables with means /θ and θ, respectively Use the factorization theorem to find the minimal sufficient statistic; you don t have to prove the minimal part Find the maximum likelihood estimator of θ Is it sufficient? 3 Show that the minimal sufficient statistic identified in Part is not complete 5

6 Solution to Problem 5 The likelihood for θ is just the joint density of the observations, and it is clear that the shape of the likelihood depends on all of the data, (X, X ) So, by the factorization theorem, there is no non-trivial sufficient statistic The derivative of the log-density is X + X /θ Setting this equal to zero and solving for θ gives the MLE, ie, ˆθ = (X /X ) / This is, of course, a function of the sufficient statistic, but since it s not one-to-one, it cannot itself be a sufficient statistic 3 To show that T = (X, X ) is not complete, we need to find a non-zero function f(t) such that E θ {f(t )} = 0 for all θ Since the exponential distribution is a scale parameter family, we expect that some multiplication or division might cancel out the dependence on θ Indeed, if we take then we get f(t) = f(t, t ) = t t, E θ {f(t )} = E θ (X X ) = E θ (X )E θ (X ) = θ θ = 0, θ Since f is not identically zero, we conclude T is not complete Problem 6 Stat 46 in six consecutive years: The table below shows yearly sales (in $0, 000) of a bookstore Suppose we choose significant level 005 Time Yr- Yr- Yr-3 Yr-4 Yr-5 Yr-6 Sales (a) Test if there exists a trend using total number of runs above and below sample median (b) Use the rank von Neumann (RVN) test to see if there exists a trend (c) Are your conclusions in (a) and (b) consistent? What s your final conclusion? Solution to Problem 6 (a) Arrange the yearly sales in increasing order:,,4,6,7,0 Then the sample median is (4 + 6)/ = 5 Using + or to indicate the yearly sales above and below the sample median, we get,,, +, +, + Then the total number of runs R = According to Table D, the p-value is P (R ) = 0 > 005 We conclude that there is no significant trend at 5% level 6

7 (b) The ranks of yearly sale, rank(x i ) s, are,, 3, 5, 4, 6 Thus the test statistic NM = 5 [rank(x i ) rank(x i+ )] = = 4 According to Table S, the p-value is P (NM 4) = 0047 We conclude that there is a significant trend at 5% level (c) The conclusions in (a) and (b) are not consistent The reason is that the RVN test is more powerful than the test based on total number of runs We follow the RVN test and conclude that there is a significant trend Problem 7 Stat 43 Consider a without-replacement sample of size from a population of size 5, with joint inclusion probabilities π = π 45 = 05, π 3 = π 34 = 0, π 3 = π 4 = 005, π 4 = π 35 = 0, and π 5 = π 5 = 008 (a) Calculate the inclusion probability π i for this design (b) The variance Horvite-Thompson estimator can be written as V (ˆt HT ) = N N ( π i ) t i + π i N k i π ik π i π k π ik t i t k Show that it is equivalent to V (ˆt HT ) = N N k i π i π k π ik π ik ( ti t ) k π i π k (Hint: N k i π ik = (n )π i ) (c) Suppose that t i = i, i =,, 5 Find ˆV HT (ˆt HT ) and ˆV SY G (ˆt HT ) for sample {, } Which one is more stable in general? Solution to Problem 7 (a) π = π + π 3 + π 4 + π 5 = 04, π = π + π 3 + π 4 + π 5 = 038, π 3 = π 3 + π 3 + π 34 + π 35 = 037, π 4 = π 4 + π 4 + π 34 + π 45 = 04, π 5 = π 5 + π 5 + π 35 + π 45 = 043 (b) Utilizing that fact that N k i π ik = (n )π i, we can draw the conclusion after some ( ) t simple algebra through the expansion of the term i π i t k π k 7

8 (c) For S = {, }, we have ˆV HT (ˆt HT ) = i S ( π i ) t i π i + i S k Sk i π ik π i π k π ik t i π i t k π k For S = {, }, we have ˆV HT (ˆt HT ) = ( π ) t π = 057 ˆV HT (ˆt HT ) = i S k Sk i + ( π ) t π π i π k π ik π ik ˆV SY G (ˆt HT ) = π π π π = π π π π t π t π ( ti t ) k π i π k ( t t ) π π SYG one is generally more stable among the two estimators Problem 8 Stat 45 Suppose that X = (X,, X n ) are iid samples from a twocomponent normal mixture model with density f(x) = αn(x µ, ) + ( α)n(x ν, ) The parameter of interest is θ = (α, µ, ν), where α [0, ] Derive the EM algorithm for computing the maximum likelihood estimator of θ Solution to Problem 8 Introduce missing data Z = (Z,, Z n ), where Z i is a label for the mixture component that X i was sampled from That is, { if X i N(µ, ) Z i = i =,, n 0 if X i N(ν, ), The complete data is Y = (X, Z), and the complete-data log-likelihood is log L Y (θ) = Z i {log α + (X i µ) } + ( Z i ){log( α) + (X i ν) } Now we need the conditional distribution of Z, given X This is effectively an application of Bayes formula, and we can write Z i (X i, θ (t) ) Ber(ω (t) i ), where ω (t) i = α (t) N(X i µ (t), ), i =,, n α (t) N(X i µ (t), ) + ( α (t) )N(X i ν (t), ) Then we have the following E- and M-steps 8

9 E-step We evaluate the Q-function as follows: Q(θ θ (t) ) = E[log L Y (θ) X, θ (t) ] = ω (t) i {log α + (X i µ) } + ( ω (t) i ){log( α) + (X i ν) } M-step Differentiation of Q with respect to each element in θ, setting the equations to zero, and solving, we get the following updates: α (t+) = n ω (t) i, µ (t+) = ω(t) i X i ω(t) i, ν (t+) = ( ω(t) i )X i ( ω(t) i ) Problem 9 Stat 46 matrix (a) Find the period of state 0 A Markov chain X 0, X, has the transition probability P = (b) Is this a regular Markov chain? If yes, determine the limiting distribution If no, explain why Solution to Problem 9 (a) Since there is only one communicating class, the periods of all states are the same Hence we only need to compute the period of state 3 But it is easy to see that the period of state 3 is (b) It is not a regular Markov chain because P n 33 will not be strictly positive for all large n (Note that if the chain starts from state 3, the probability it comes back to state 3 in odd number of steps is zero) Problem 0 Stat 46 A Markov chain has a state space S = {0,, } and the following transition probability matrix If the chain starts from state, find the probability that it does not visit state prior its absorption at state 0 9

10 Solution to Problem 0 The question is equivalent to assuming that state is also an absorption state and ask what is the probability that the chain is stopped at state 0, instead of state More precisely, we have a Markov chain with state space S = {0,, } and the following transition probability matrix We want to compute the probability that the chain is eventually stopped at state 0 There are several ways to compute this probability For example, it is computed as = 3 Problem Stat 48 We are interested the effect of nozzle type on the rate of fluid flow Three specific nozzle types are under study Five runs through each of the three nozzle types led to the following results: Nozzle Type Rate of Flow ȳ i = 5 j y ij j (y ij ȳ i ) A B C Construct the ANOVA table and test whether nozzle type has an effect on fluid flow Use significance level α = 005 and F critical value F(005;,) = 389 Solution to Problem The ANOVA table is Source SS df MS F Treatment Error Total where 484 = , ȳ = ( )/3 = 9673, 633 = 5 (( ) + ( ) + ( ) ) Since 34 < F (005;, ) = 389, there is no sufficient evidence to conclude that nozzle type has an effect Problem Stat 48 Y, the model is Consider one explanatory variable x and the response variable Y i = β 0 + β x i + ε i, i =,, n, where the iid errors ε i N (0, σ ) (a) Derive the least square estimates ˆβ 0 and ˆβ as follows such that the fitted line ˆβ 0 + ˆβ x is the best straight line to fit the observed data (x, y ),, (x n, y n ),ie ˆβ = (x i x) y i (x i x), ˆβ 0 = ȳ ˆβ x 0

11 given that ( x i) n (x i ) = (x i x) (b) Prove that the distribution of ˆβ is as follows: ) ˆβ N (β, σ s xx where s xx = (x i x) (c) Based on the sampling distribution of ( ) ˆσ ˆβ β / s xx when σ is unknown, construct a confidence interval for β How to detect the existence of the linear relationship by using this confidence interval? Solution to Problem (a) Least square criterion Q (β 0, β ) = n (Y i β 0 + β x i + ε i ) Take derivative with respect to β 0, β respectively Solve the equation ˆβ = Q (β 0, β ) β 0 = Q (β 0, β ) β = (b) Let c i = (x i x) (x i x), then (Y i β 0 + β x i + ε i ) = 0 (Y i β 0 + β x i + ε i ) x i = 0 (x i x)y i (x i x), ˆβ 0 = ȳ ˆβ x ˆβ = (x i x) y i (x i x) = c i y i, as y i ind N (β 0 + β x i, σ ), then ( ˆβ N c i (β 0 + β x i ), ) c i σ = N (β, σ (c) Take ˆσ = MSE = SSE/ (n ) ( ) ˆσ ˆβ β / t (n ) s xx then the confidence interval for β : ˆβ ± t α (n ) MSE s xx If the confidence interval doesn t cover zero point, then there exists significant linear relationship between the response and the covariate at the given significance level s xx )

Master s Written Examination

Master s Written Examination Master s Written Examination Option: Statistics and Probability Spring 016 Full points may be obtained for correct answers to eight questions. Each numbered question which may have several parts is worth

More information

Master s Written Examination - Solution

Master s Written Examination - Solution Master s Written Examination - Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2

More information

Master s Written Examination

Master s Written Examination Master s Written Examination Option: Statistics and Probability Fall 2013 Full points may be obtained for correct answers to eight questions. Each numbered question (which may have several parts) is worth

More information

This does not cover everything on the final. Look at the posted practice problems for other topics.

This does not cover everything on the final. Look at the posted practice problems for other topics. Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

Problem Selected Scores

Problem Selected Scores Statistics Ph.D. Qualifying Exam: Part II November 20, 2010 Student Name: 1. Answer 8 out of 12 problems. Mark the problems you selected in the following table. Problem 1 2 3 4 5 6 7 8 9 10 11 12 Selected

More information

ECE 275A Homework 7 Solutions

ECE 275A Homework 7 Solutions ECE 275A Homework 7 Solutions Solutions 1. For the same specification as in Homework Problem 6.11 we want to determine an estimator for θ using the Method of Moments (MOM). In general, the MOM estimator

More information

Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014

Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Put your solution to each problem on a separate sheet of paper. Problem 1. (5166) Assume that two random samples {x i } and {y i } are independently

More information

Master s Examination Solutions Option Statistics and Probability Fall 2011

Master s Examination Solutions Option Statistics and Probability Fall 2011 Master s Examination Solutions Option Statistics and Probability Fall 211 1. (STAT 41) Suppose that X, Y and Z are i.i.d. Uniform(,1). Let t > be a fixed constant. (i) Compute P ( X Y t). (ii) Compute

More information

Lecture 15. Hypothesis testing in the linear model

Lecture 15. Hypothesis testing in the linear model 14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma

More information

STAT5044: Regression and Anova. Inyoung Kim

STAT5044: Regression and Anova. Inyoung Kim STAT5044: Regression and Anova Inyoung Kim 2 / 47 Outline 1 Regression 2 Simple Linear regression 3 Basic concepts in regression 4 How to estimate unknown parameters 5 Properties of Least Squares Estimators:

More information

Spring 2012 Math 541B Exam 1

Spring 2012 Math 541B Exam 1 Spring 2012 Math 541B Exam 1 1. A sample of size n is drawn without replacement from an urn containing N balls, m of which are red and N m are black; the balls are otherwise indistinguishable. Let X denote

More information

where x and ȳ are the sample means of x 1,, x n

where x and ȳ are the sample means of x 1,, x n y y Animal Studies of Side Effects Simple Linear Regression Basic Ideas In simple linear regression there is an approximately linear relation between two variables say y = pressure in the pancreas x =

More information

Spring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n =

Spring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n = Spring 2012 Math 541A Exam 1 1. (a) Let Z i be independent N(0, 1), i = 1, 2,, n. Are Z = 1 n n Z i and S 2 Z = 1 n 1 n (Z i Z) 2 independent? Prove your claim. (b) Let X 1, X 2,, X n be independent identically

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:.

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:. MATHEMATICAL STATISTICS Homework assignment Instructions Please turn in the homework with this cover page. You do not need to edit the solutions. Just make sure the handwriting is legible. You may discuss

More information

Statistics & Data Sciences: First Year Prelim Exam May 2018

Statistics & Data Sciences: First Year Prelim Exam May 2018 Statistics & Data Sciences: First Year Prelim Exam May 2018 Instructions: 1. Do not turn this page until instructed to do so. 2. Start each new question on a new sheet of paper. 3. This is a closed book

More information

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu Home Work: 1 1. Describe the sample space when a coin is tossed (a) once, (b) three times, (c) n times, (d) an infinite number of times. 2. A coin is tossed until for the first time the same result appear

More information

BIOS 2083 Linear Models c Abdus S. Wahed

BIOS 2083 Linear Models c Abdus S. Wahed Chapter 5 206 Chapter 6 General Linear Model: Statistical Inference 6.1 Introduction So far we have discussed formulation of linear models (Chapter 1), estimability of parameters in a linear model (Chapter

More information

Problem 1 (20) Log-normal. f(x) Cauchy

Problem 1 (20) Log-normal. f(x) Cauchy ORF 245. Rigollet Date: 11/21/2008 Problem 1 (20) f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.0 0.2 0.4 0.6 0.8 4 2 0 2 4 Normal (with mean -1) 4 2 0 2 4 Negative-exponential x x f(x) f(x) 0.0 0.1 0.2 0.3 0.4 0.5

More information

General Linear Model: Statistical Inference

General Linear Model: Statistical Inference Chapter 6 General Linear Model: Statistical Inference 6.1 Introduction So far we have discussed formulation of linear models (Chapter 1), estimability of parameters in a linear model (Chapter 4), least

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Chapter 7. Hypothesis Testing

Chapter 7. Hypothesis Testing Chapter 7. Hypothesis Testing Joonpyo Kim June 24, 2017 Joonpyo Kim Ch7 June 24, 2017 1 / 63 Basic Concepts of Testing Suppose that our interest centers on a random variable X which has density function

More information

This paper is not to be removed from the Examination Halls

This paper is not to be removed from the Examination Halls ~~ST104B ZA d0 This paper is not to be removed from the Examination Halls UNIVERSITY OF LONDON ST104B ZB BSc degrees and Diplomas for Graduates in Economics, Management, Finance and the Social Sciences,

More information

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation Yujin Chung November 29th, 2016 Fall 2016 Yujin Chung Lec13: MLE Fall 2016 1/24 Previous Parametric tests Mean comparisons (normality assumption)

More information

UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables

UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables To be provided to students with STAT2201 or CIVIL-2530 (Probability and Statistics) Exam Main exam date: Tuesday, 20 June 1

More information

Stat 579: Generalized Linear Models and Extensions

Stat 579: Generalized Linear Models and Extensions Stat 579: Generalized Linear Models and Extensions Mixed models Yan Lu March, 2018, week 8 1 / 32 Restricted Maximum Likelihood (REML) REML: uses a likelihood function calculated from the transformed set

More information

Chapter 1. Linear Regression with One Predictor Variable

Chapter 1. Linear Regression with One Predictor Variable Chapter 1. Linear Regression with One Predictor Variable 1.1 Statistical Relation Between Two Variables To motivate statistical relationships, let us consider a mathematical relation between two mathematical

More information

Notes on the Multivariate Normal and Related Topics

Notes on the Multivariate Normal and Related Topics Version: July 10, 2013 Notes on the Multivariate Normal and Related Topics Let me refresh your memory about the distinctions between population and sample; parameters and statistics; population distributions

More information

Exercises and Answers to Chapter 1

Exercises and Answers to Chapter 1 Exercises and Answers to Chapter The continuous type of random variable X has the following density function: a x, if < x < a, f (x), otherwise. Answer the following questions. () Find a. () Obtain mean

More information

I i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic.

I i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic. Serik Sagitov, Chalmers and GU, February, 08 Solutions chapter Matlab commands: x = data matrix boxplot(x) anova(x) anova(x) Problem.3 Consider one-way ANOVA test statistic For I = and = n, put F = MS

More information

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018 Mathematics Ph.D. Qualifying Examination Stat 52800 Probability, January 2018 NOTE: Answers all questions completely. Justify every step. Time allowed: 3 hours. 1. Let X 1,..., X n be a random sample from

More information

Generalized Linear Models Introduction

Generalized Linear Models Introduction Generalized Linear Models Introduction Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Generalized Linear Models For many problems, standard linear regression approaches don t work. Sometimes,

More information

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part : Sample Problems for the Elementary Section of Qualifying Exam in Probability and Statistics https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part 2: Sample Problems for the Advanced Section

More information

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression

Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression BSTT523: Kutner et al., Chapter 1 1 Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression Introduction: Functional relation between

More information

Answer Key for STAT 200B HW No. 7

Answer Key for STAT 200B HW No. 7 Answer Key for STAT 200B HW No. 7 May 5, 2007 Problem 2.2 p. 649 Assuming binomial 2-sample model ˆπ =.75, ˆπ 2 =.6. a ˆτ = ˆπ 2 ˆπ =.5. From Ex. 2.5a on page 644: ˆπ ˆπ + ˆπ 2 ˆπ 2.75.25.6.4 = + =.087;

More information

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B Simple Linear Regression 35 Problems 1 Consider a set of data (x i, y i ), i =1, 2,,n, and the following two regression models: y i = β 0 + β 1 x i + ε, (i =1, 2,,n), Model A y i = γ 0 + γ 1 x i + γ 2

More information

Linear Methods for Prediction

Linear Methods for Prediction Chapter 5 Linear Methods for Prediction 5.1 Introduction We now revisit the classification problem and focus on linear methods. Since our prediction Ĝ(x) will always take values in the discrete set G we

More information

Brief Review on Estimation Theory

Brief Review on Estimation Theory Brief Review on Estimation Theory K. Abed-Meraim ENST PARIS, Signal and Image Processing Dept. abed@tsi.enst.fr This presentation is essentially based on the course BASTA by E. Moulines Brief review on

More information

Practical Econometrics. for. Finance and Economics. (Econometrics 2)

Practical Econometrics. for. Finance and Economics. (Econometrics 2) Practical Econometrics for Finance and Economics (Econometrics 2) Seppo Pynnönen and Bernd Pape Department of Mathematics and Statistics, University of Vaasa 1. Introduction 1.1 Econometrics Econometrics

More information

STAT 730 Chapter 4: Estimation

STAT 730 Chapter 4: Estimation STAT 730 Chapter 4: Estimation Timothy Hanson Department of Statistics, University of South Carolina Stat 730: Multivariate Analysis 1 / 23 The likelihood We have iid data, at least initially. Each datum

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7

MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7 MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7 1 Random Vectors Let a 0 and y be n 1 vectors, and let A be an n n matrix. Here, a 0 and A are non-random, whereas y is

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida First Year Examination Department of Statistics, University of Florida August 19, 010, 8:00 am - 1:00 noon Instructions: 1. You have four hours to answer questions in this examination.. You must show your

More information

F & B Approaches to a simple model

F & B Approaches to a simple model A6523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 215 http://www.astro.cornell.edu/~cordes/a6523 Lecture 11 Applications: Model comparison Challenges in large-scale surveys

More information

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari MS&E 226: Small Data Lecture 11: Maximum likelihood (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 18 The likelihood function 2 / 18 Estimating the parameter This lecture develops the methodology behind

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

simple if it completely specifies the density of x

simple if it completely specifies the density of x 3. Hypothesis Testing Pure significance tests Data x = (x 1,..., x n ) from f(x, θ) Hypothesis H 0 : restricts f(x, θ) Are the data consistent with H 0? H 0 is called the null hypothesis simple if it completely

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there

More information

Hypothesis Testing: The Generalized Likelihood Ratio Test

Hypothesis Testing: The Generalized Likelihood Ratio Test Hypothesis Testing: The Generalized Likelihood Ratio Test Consider testing the hypotheses H 0 : θ Θ 0 H 1 : θ Θ \ Θ 0 Definition: The Generalized Likelihood Ratio (GLR Let L(θ be a likelihood for a random

More information

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics Mathematics Qualifying Examination January 2015 STAT 52800 - Mathematical Statistics NOTE: Answer all questions completely and justify your derivations and steps. A calculator and statistical tables (normal,

More information

[y i α βx i ] 2 (2) Q = i=1

[y i α βx i ] 2 (2) Q = i=1 Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

Probability Theory and Statistics. Peter Jochumzen

Probability Theory and Statistics. Peter Jochumzen Probability Theory and Statistics Peter Jochumzen April 18, 2016 Contents 1 Probability Theory And Statistics 3 1.1 Experiment, Outcome and Event................................ 3 1.2 Probability............................................

More information

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score

More information

STAT 705 Chapter 16: One-way ANOVA

STAT 705 Chapter 16: One-way ANOVA STAT 705 Chapter 16: One-way ANOVA Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 21 What is ANOVA? Analysis of variance (ANOVA) models are regression

More information

Unbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others.

Unbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others. Unbiased Estimation Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others. To compare ˆθ and θ, two estimators of θ: Say ˆθ is better than θ if it

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

Part IB Statistics. Theorems with proof. Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua. Lent 2015

Part IB Statistics. Theorems with proof. Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua. Lent 2015 Part IB Statistics Theorems with proof Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua Lent 2015 These notes are not endorsed by the lecturers, and I have modified them (often significantly)

More information

Hypothesis Testing. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA

Hypothesis Testing. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA Hypothesis Testing Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA An Example Mardia et al. (979, p. ) reprint data from Frets (9) giving the length and breadth (in

More information

Statistics and Econometrics I

Statistics and Econometrics I Statistics and Econometrics I Point Estimation Shiu-Sheng Chen Department of Economics National Taiwan University September 13, 2016 Shiu-Sheng Chen (NTU Econ) Statistics and Econometrics I September 13,

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2008 Prof. Gesine Reinert 1 Data x = x 1, x 2,..., x n, realisations of random variables X 1, X 2,..., X n with distribution (model)

More information

Test Code: STA/STB (Short Answer Type) 2013 Junior Research Fellowship for Research Course in Statistics

Test Code: STA/STB (Short Answer Type) 2013 Junior Research Fellowship for Research Course in Statistics Test Code: STA/STB (Short Answer Type) 2013 Junior Research Fellowship for Research Course in Statistics The candidates for the research course in Statistics will have to take two shortanswer type tests

More information

MAS223 Statistical Inference and Modelling Exercises

MAS223 Statistical Inference and Modelling Exercises MAS223 Statistical Inference and Modelling Exercises The exercises are grouped into sections, corresponding to chapters of the lecture notes Within each section exercises are divided into warm-up questions,

More information

Introduction to Normal Distribution

Introduction to Normal Distribution Introduction to Normal Distribution Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 17-Jan-2017 Nathaniel E. Helwig (U of Minnesota) Introduction

More information

2. Variance and Covariance: We will now derive some classic properties of variance and covariance. Assume real-valued random variables X and Y.

2. Variance and Covariance: We will now derive some classic properties of variance and covariance. Assume real-valued random variables X and Y. CS450 Final Review Problems Fall 08 Solutions or worked answers provided Problems -6 are based on the midterm review Identical problems are marked recap] Please consult previous recitations and textbook

More information

Formal Statement of Simple Linear Regression Model

Formal Statement of Simple Linear Regression Model Formal Statement of Simple Linear Regression Model Y i = β 0 + β 1 X i + ɛ i Y i value of the response variable in the i th trial β 0 and β 1 are parameters X i is a known constant, the value of the predictor

More information

2014/2015 Smester II ST5224 Final Exam Solution

2014/2015 Smester II ST5224 Final Exam Solution 014/015 Smester II ST54 Final Exam Solution 1 Suppose that (X 1,, X n ) is a random sample from a distribution with probability density function f(x; θ) = e (x θ) I [θ, ) (x) (i) Show that the family of

More information

STAT 100C: Linear models

STAT 100C: Linear models STAT 100C: Linear models Arash A. Amini June 9, 2018 1 / 56 Table of Contents Multiple linear regression Linear model setup Estimation of β Geometric interpretation Estimation of σ 2 Hat matrix Gram matrix

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part 1: Sample Problems for the Elementary Section of Qualifying Exam in Probability and Statistics https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part 2: Sample Problems for the Advanced Section

More information

Statistics Masters Comprehensive Exam March 21, 2003

Statistics Masters Comprehensive Exam March 21, 2003 Statistics Masters Comprehensive Exam March 21, 2003 Student Name: 1. Answer 8 out of 12 problems. Mark the problems you selected in the following table. 1 2 3 4 5 6 7 8 9 10 11 12 2. Write your answer

More information

Quick Tour of Basic Probability Theory and Linear Algebra

Quick Tour of Basic Probability Theory and Linear Algebra Quick Tour of and Linear Algebra Quick Tour of and Linear Algebra CS224w: Social and Information Network Analysis Fall 2011 Quick Tour of and Linear Algebra Quick Tour of and Linear Algebra Outline Definitions

More information

Unbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others.

Unbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others. Unbiased Estimation Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others. To compare ˆθ and θ, two estimators of θ: Say ˆθ is better than θ if it

More information

STAT 461/561- Assignments, Year 2015

STAT 461/561- Assignments, Year 2015 STAT 461/561- Assignments, Year 2015 This is the second set of assignment problems. When you hand in any problem, include the problem itself and its number. pdf are welcome. If so, use large fonts and

More information

Non-parametric Inference and Resampling

Non-parametric Inference and Resampling Non-parametric Inference and Resampling Exercises by David Wozabal (Last update. Juni 010) 1 Basic Facts about Rank and Order Statistics 1.1 10 students were asked about the amount of time they spend surfing

More information

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y

More information

Lecture 7 Introduction to Statistical Decision Theory

Lecture 7 Introduction to Statistical Decision Theory Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7

More information

STAT 540: Data Analysis and Regression

STAT 540: Data Analysis and Regression STAT 540: Data Analysis and Regression Wen Zhou http://www.stat.colostate.edu/~riczw/ Email: riczw@stat.colostate.edu Department of Statistics Colorado State University Fall 205 W. Zhou (Colorado State

More information

Markov Chain Monte Carlo (MCMC)

Markov Chain Monte Carlo (MCMC) Markov Chain Monte Carlo (MCMC Dependent Sampling Suppose we wish to sample from a density π, and we can evaluate π as a function but have no means to directly generate a sample. Rejection sampling can

More information

STAT 4385 Topic 03: Simple Linear Regression

STAT 4385 Topic 03: Simple Linear Regression STAT 4385 Topic 03: Simple Linear Regression Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2017 Outline The Set-Up Exploratory Data Analysis

More information

Masters Comprehensive Examination Department of Statistics, University of Florida

Masters Comprehensive Examination Department of Statistics, University of Florida Masters Comprehensive Examination Department of Statistics, University of Florida May 6, 003, 8:00 am - :00 noon Instructions: You have four hours to answer questions in this examination You must show

More information

EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY

EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 2013 MODULE 5 : Further probability and inference Time allowed: One and a half hours Candidates should answer THREE questions.

More information

Lecture 3. Inference about multivariate normal distribution

Lecture 3. Inference about multivariate normal distribution Lecture 3. Inference about multivariate normal distribution 3.1 Point and Interval Estimation Let X 1,..., X n be i.i.d. N p (µ, Σ). We are interested in evaluation of the maximum likelihood estimates

More information

McGill University. Faculty of Science. Department of Mathematics and Statistics. Part A Examination. Statistics: Theory Paper

McGill University. Faculty of Science. Department of Mathematics and Statistics. Part A Examination. Statistics: Theory Paper McGill University Faculty of Science Department of Mathematics and Statistics Part A Examination Statistics: Theory Paper Date: 10th May 2015 Instructions Time: 1pm-5pm Answer only two questions from Section

More information

If we want to analyze experimental or simulated data we might encounter the following tasks:

If we want to analyze experimental or simulated data we might encounter the following tasks: Chapter 1 Introduction If we want to analyze experimental or simulated data we might encounter the following tasks: Characterization of the source of the signal and diagnosis Studying dependencies Prediction

More information

Nonresponse weighting adjustment using estimated response probability

Nonresponse weighting adjustment using estimated response probability Nonresponse weighting adjustment using estimated response probability Jae-kwang Kim Yonsei University, Seoul, Korea December 26, 2006 Introduction Nonresponse Unit nonresponse Item nonresponse Basic strategy

More information

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1. Problem 1 (21 points) An economist runs the regression y i = β 0 + x 1i β 1 + x 2i β 2 + x 3i β 3 + ε i (1) The results are summarized in the following table: Equation 1. Variable Coefficient Std. Error

More information

2018 2019 1 9 sei@mistiu-tokyoacjp http://wwwstattu-tokyoacjp/~sei/lec-jhtml 11 552 3 0 1 2 3 4 5 6 7 13 14 33 4 1 4 4 2 1 1 2 2 1 1 12 13 R?boxplot boxplotstats which does the computation?boxplotstats

More information

Lecture 8: Information Theory and Statistics

Lecture 8: Information Theory and Statistics Lecture 8: Information Theory and Statistics Part II: Hypothesis Testing and I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 23, 2015 1 / 50 I-Hsiang

More information

Review and continuation from last week Properties of MLEs

Review and continuation from last week Properties of MLEs Review and continuation from last week Properties of MLEs As we have mentioned, MLEs have a nice intuitive property, and as we have seen, they have a certain equivariance property. We will see later that

More information

Estimation Theory. as Θ = (Θ 1,Θ 2,...,Θ m ) T. An estimator

Estimation Theory. as Θ = (Θ 1,Θ 2,...,Θ m ) T. An estimator Estimation Theory Estimation theory deals with finding numerical values of interesting parameters from given set of data. We start with formulating a family of models that could describe how the data were

More information

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Correlation and the Analysis of Variance Approach to Simple Linear Regression Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation

More information

CHAPTER 2 SIMPLE LINEAR REGRESSION

CHAPTER 2 SIMPLE LINEAR REGRESSION CHAPTER 2 SIMPLE LINEAR REGRESSION 1 Examples: 1. Amherst, MA, annual mean temperatures, 1836 1997 2. Summer mean temperatures in Mount Airy (NC) and Charleston (SC), 1948 1996 Scatterplots outliers? influential

More information

Math 494: Mathematical Statistics

Math 494: Mathematical Statistics Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/

More information

HT Introduction. P(X i = x i ) = e λ λ x i

HT Introduction. P(X i = x i ) = e λ λ x i MODS STATISTICS Introduction. HT 2012 Simon Myers, Department of Statistics (and The Wellcome Trust Centre for Human Genetics) myers@stats.ox.ac.uk We will be concerned with the mathematical framework

More information

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable, Concordia University Department of Mathematics and Statistics Course Number Section Statistics 360/2 01 Examination Date Time Pages Final December 2002 3 hours 6 Instructors Course Examiner Marks Y.P.

More information

Masters Comprehensive Examination Department of Statistics, University of Florida

Masters Comprehensive Examination Department of Statistics, University of Florida Masters Comprehensive Examination Department of Statistics, University of Florida May 10, 2002, 8:00am - 12:00 noon Instructions: 1. You have four hours to answer questions in this examination. 2. There

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information