STT 843 Key to Homework 1 Spring 2018

Similar documents
Multivariate Analysis Homework 1

1. Density and properties Brief outline 2. Sampling from multivariate normal and MLE 3. Sampling distribution and large sample behavior of X and S 4.

Exam 2. Jeremy Morris. March 23, 2006

Linear models and their mathematical foundations: Simple linear regression

You can compute the maximum likelihood estimate for the correlation

Lecture 11. Multivariate Normal theory

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 501 Assignment 2 NAME Spring Chapter 5, and Sections in Johnson & Wichern.

Asymptotic Statistics-III. Changliang Zou

Simple Linear Regression

Math 181B Homework 1 Solution

STA 2201/442 Assignment 2

EE/CpE 345. Modeling and Simulation. Fall Class 10 November 18, 2002

STAT 501 EXAM I NAME Spring 1999

TMA 4275 Lifetime Analysis June 2004 Solution

Hypothesis Testing One Sample Tests

MATH5745 Multivariate Methods Lecture 07

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

Homework 2: Simple Linear Regression

Modeling and Performance Analysis with Discrete-Event Simulation

Notes on the Multivariate Normal and Related Topics

EE/CpE 345. Modeling and Simulation. Fall Class 9

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

Statistics Ph.D. Qualifying Exam: Part I October 18, 2003

STA 4322 Exam I Name: Introduction to Statistics Theory

Asymptotic Statistics-VI. Changliang Zou

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is

Random Variables and Their Distributions

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Multivariate Statistical Analysis

4 Statistics of Normally Distributed Data

This does not cover everything on the final. Look at the posted practice problems for other topics.

Probability Theory and Statistics. Peter Jochumzen

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Introduction to Normal Distribution

Chapter 5 continued. Chapter 5 sections

Module 5 Practice problem and Homework answers

Applied Multivariate and Longitudinal Data Analysis

Inferences about a Mean Vector

Random Matrices and Multivariate Statistical Analysis

Lecture 3. Inference about multivariate normal distribution

Exercises and Answers to Chapter 1

Can you tell the relationship between students SAT scores and their college grades?

Stat 216 Final Solutions

Random vectors X 1 X 2. Recall that a random vector X = is made up of, say, k. X k. random variables.

You may not use your books/notes on this exam. You may use calculator.

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

A Bivariate Weibull Regression Model

Canonical Correlation Analysis of Longitudinal Data

Linear Models and Estimation by Least Squares

Stat 206: the Multivariate Normal distribution

STAT 4385 Topic 01: Introduction & Review

Institute of Actuaries of India

F & B Approaches to a simple model

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Statistical Inference

THE UNIVERSITY OF CHICAGO Graduate School of Business Business 41912, Spring Quarter 2008, Mr. Ruey S. Tsay. Solutions to Final Exam

[y i α βx i ] 2 (2) Q = i=1

Lesson 8: Testing for IID Hypothesis with the correlogram

ECON Fundamentals of Probability

STA 2101/442 Assignment 3 1

Multiple Random Variables

Bivariate Relationships Between Variables

Multivariate Random Variable

Summary of Chapters 7-9

Ch 2: Simple Linear Regression

Psychology 282 Lecture #4 Outline Inferences in SLR

Latent variable interactions

Random Vectors and Multivariate Normal Distributions

Accounting for Population Uncertainty in Covariance Structure Analysis

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.

Introduction to statistics. Laurent Eyer Maria Suveges, Marie Heim-Vögtlin SNSF grant

Introduction to Computational Finance and Financial Econometrics Matrix Algebra Review

Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT):

Inferences for Correlation

Multivariate Regression

If we want to analyze experimental or simulated data we might encounter the following tasks:

Principal Component Analysis for a Spiked Covariance Model with Largest Eigenvalues of the Same Asymptotic Order of Magnitude

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages

Comment about AR spectral estimation Usually an estimate is produced by computing the AR theoretical spectrum at (ˆφ, ˆσ 2 ). With our Monte Carlo

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

Final Examination Statistics 200C. T. Ferguson June 11, 2009

Statistics Introductory Correlation

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Estadística II Chapter 4: Simple linear regression

STP 226 ELEMENTARY STATISTICS NOTES

f (1 0.5)/n Z =

MAS223 Statistical Inference and Modelling Exercises

Preliminary Statistics. Lecture 3: Probability Models and Distributions

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

BIOS 2083 Linear Models Abdus S. Wahed. Chapter 2 84

Computational functional genomics

Introduction to Computational Finance and Financial Econometrics Probability Review - Part 2

Chapter 7 Comparison of two independent samples

Multivariate Distributions

The Chi-Square Distributions

Transcription:

STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ = ( 2 2/2 2/2 Then it follows that Σ = 3/2, Σ /2 = 3/2 and Σ = 2 ( 2/2 3 2/2 2 As a result, the bivariate normal density is f(x = (2π 3/2 exp { ( 2 (x, x 2 2Σ x } x 2 2 (b From part (a, we see that = 6π exp { 3 (x2 2x (x 2 2 + 2(x 2 2 2 } (x µ T Σ (x µ = 2 3 (x2 2x (x 2 2 + 2(x 2 2 2 (c The constant density contours satisfies the equation: (x µ T Σ (x µ = c The area that this contour contains is described by R(c = {(x, x 2 : (x µ T Σ (x µ c} To find the constant density contours that contains 50% probability, we need to determine c such that P {X R(c} = 05 This is equivalent to find c such that P {(X, X 2 : (X µ T Σ (X µ c} = 05 Because (X µ T Σ (X µ follows a chi-square distribution with degrees of freedom 2, the constant c should be the 50% quantile of the chi-square distribution That is c = χ 2 2,05 Thus, the contour that contains 50% probability is (x µ T Σ (x µ = χ 2 2,05 = 386294

x 2 0 2 3 4-2 - 0 2 x Figure : The density contour plot that contains 50% probability 44 (a Let c = (3, 2, Then, 3X 2X 2 + X 3 = c X is linear combination of normally distributed random vector Thus, c X has a normal distribution with mean c µ and variance c Σc Here c µ = 3 2+( 2 ( 3+ = 3, and c Σc = (3, 2, 3 2 2 2 3 2 = 9 (b Let a = (a, a 2 Because X 2 and X 2 a X a 2 X 3 are both normally distributed, X 2 and X 2 a X a 2 X 3 are independent if the covariance between X 2 and X 2 a X a 2 X 3 is 0 Then, Cov(X 2, X 2 a X a 2 X 3 = σ 22 a σ 2 a 2 σ 23 = 3 a 2a 2 = 0 Any vector a = (a, a 2 satisfying condition 3 a 2a 2 = 0 ensures the independence between X 2 and X 2 a X a 2 X 3 46 (a Because σ 2 = 0, X and X 2 are independent (b Because σ 3 =, X and X 3 are not independent (c Because σ 23 = 0, X 2 and X 3 are independent (d Because (σ 2, σ 32 = (0, 0, (X, X 3 and X 2 are independent (e Because Cov(X, X + 3X 2 2X 3 = σ + 3σ 2 2σ 3 = 4 2( = 6, X and X + 3X 2 2X 3 are not independent 2

47 (a The joint ( distribution of (X, X 3 is normal with mean (, 2 and covariance By Result 46 in the textbook, we know that X 4 2 given X 3 = x 3 is normally distributed with mean µ + σ 3 σ33 (x 3 µ 3 = + ( 2 (x 3 2 = 05x 3 + 2 and covariance σ σ 3 σ33 σ 3 = 4 ( 2 /2 = 35 (b The conditional distribution of X given X 2 = x 2, X 3 = x 3 is normal with mean ( ( 5 0 x2 + + (0, = 05x 0 2 3 + x 3 2 and covariance ( 5 0 σ (0, 0 2 ( 0 = 35 46 (a Because V and V 2 are linear combinations of X,, X 4, V and V 2 both have multivariate normal distribution Therefore, we only need to determine the means and covariances of V and V 2 For V, the mean is and covariance is E(V = 4 E(X 4 E(X 2 + 4 E(X 3 4 E(X 4 = 0 Cov(V = 6 Cov(X + 6 Cov(X 2 + 6 Cov(X 3 + 6 Cov(X 4 = 4 Σ Therefore, V is normally distributed with mean 0 and covariance 4 Σ For V 2, the mean is and covariance is E(V 2 = 4 E(X + 4 E(X 2 4 E(X 3 4 E(X 4 = 0 Cov(V 2 = 6 Cov(X + 6 Cov(X 2 + 6 Cov(X 3 + 6 Cov(X 4 = 4 Σ Therefore, V 2 is normally distributed with mean 0 and covariance 4 Σ (b Because (V, V 2 are linear combinations of X,, X 4, (V, V 2 is multivariate normally distributed From part (a, the mean is 0 The covariance between V and V 2 is Cov(V, V 2 = 6 Cov(X 6 Cov(X 2 6 Cov(X 3 + 6 Cov(X 4 = 0 3

Therefore, the joint distribution of (V, V covariance ( 4 Σ 0 0 4 Σ As a result, the joint density of (V, V 2 is 2 is normal with mean 0 and f(v, V 2 = (2π p 4 Σ exp{ 2 V ( 4 Σ V 2 V 2( 4 Σ V 2 } 48 The maximum likelihood estimator for µ is X n = 4 4 X i = i= The maximum likelihood estimator for Σ is ( 4 6 ˆΣ = 4 X X X n X n = ( /2 /4 /4 3/2 49 (a The distribution of (X µ Σ (X µ is chi-square distribution with degrees of freedom 6 (b The distribution of X is normal with mean µ and covariance Σ/20 The distribution of n( X µ is normal with mean 0 and covariance Σ (c The distribution of (n S is Wishart distribution with degrees of freedom 9 and covariance Σ, namely W 6 (9, Σ 42 (a The distribution of X is normal with mean µ and covariance Σ/60 (b The distribution of (X µ Σ (X µ is chi-square distribution with degrees of freedom 4 (c The distribution of n( X µ Σ ( X µ is chi-square distribution with degrees of freedom 4 (c The approximate distribution of n( X µ S ( X µ is chi-square distribution with degrees of freedom 4 423 (a The QQ plot is given in Figure 2 Based on the QQ plot, it seems that most data points are along the straight line This might indicate that normal distribution is appropriate for this data set However, due to the small sample size, the sample quantile estimated based on 0 data points might not be very informative Therefore, the conclusion based on the QQ plot might not be reliable for this data set (b Using the formula given in equation (4-3 on page 8, the correlation coefficient r Q is defined by n j= r Q = (x (j x(q (j q n j= (x n (j x 2 j= (q (j q 2 4

Normal Q-Q Plot Sample Quantiles -0 0 0 20-5 -0-05 00 05 0 5 Theoretical Quantiles Figure 2: Q-Q plot of the annual rates of return given in question 423 The above coefficient r Q is 0947045 The critical value for the test of normality can be found in Table 42 For our case, the sample size is 0 and significant level is 0%, the critical value is 0935 This means that r Q is larger than the corresponding critical value Thus, we fail to reject the null hypothesis and conclude that the normality assumption might be appropriate for this data set We still need to be cautious about this conclusion because the data might not be independent Therefore, the critical value used in our test might not be appropriate 426 (a Using R, we obtain the mean x = (5200, 248 and sample covariance as ( 062222 77022 77022 3085437 Then the squared distances are 8753045 20203262 29009088 07352659 030592 007662 3732902 086540 3753379 4252799 (b Comparing the squared distances with the 50% quantile of chi-square distribution with degrees of freedom 2 (ie, 386294, we find that the 4-th, 5-th, 6-th, 8-th and 9-th data points fall within the estimated 50% probability contour of a bivariate normal distribution The proportion of the observations falling within the contour is 50% (c A chi-square plot is given in the following Figure 4 5

x 2 5 0 5 20 0 2 4 6 8 0 2 x Figure 3: The density contour plot that contains 50% probability and scatter plot of data points Chi-square plot for χ 2 2 djs 0 2 3 4 0 2 4 6 8 0 2 4 qchisq(ppoints(500, df = 2 Figure 4: The chi-square plot for checking normality 6

(d We observe from the chi-square plot given in part (c, most data points except the 0-th data point are along the theoretical line Therefore, these data are approximately bivariate normal 7