f (1 0.5)/n Z =

Similar documents
M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

Asymptotic Statistics-III. Changliang Zou

Math 180A. Lecture 16 Friday May 7 th. Expectation. Recall the three main probability density functions so far (1) Uniform (2) Exponential.

1. Point Estimators, Review

ELEG 3143 Probability & Stochastic Process Ch. 6 Stochastic Process

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER. 21 June :45 11:45

Spring 2012 Math 541B Exam 1

Spring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n =

Chapter 4. Continuous Random Variables

Problem 1 (20) Log-normal. f(x) Cauchy

Evaluating the Performance of Estimators (Section 7.3)

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs

Answers to the 8th problem set. f(x θ = θ 0 ) L(θ 0 )

Statistics GIDP Ph.D. Qualifying Exam Theory Jan 11, 2016, 9:00am-1:00pm

Simulation. Alberto Ceselli MSc in Computer Science Univ. of Milan. Part 4 - Statistical Analysis of Simulated Data

Exam C Solutions Spring 2005

Central Limit Theorem ( 5.3)

FINAL EXAM: Monday 8-10am

BEST TESTS. Abstract. We will discuss the Neymann-Pearson theorem and certain best test where the power function is optimized.

i=1 X i/n i=1 (X i X) 2 /(n 1). Find the constant c so that the statistic c(x X n+1 )/S has a t-distribution. If n = 8, determine k such that

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that

First Year Examination Department of Statistics, University of Florida

Non-parametric Inference and Resampling

Space Telescope Science Institute statistics mini-course. October Inference I: Estimation, Confidence Intervals, and Tests of Hypotheses

Review of Probability. CS1538: Introduction to Simulations

Unbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others.

ISyE 6644 Fall 2014 Test 3 Solutions

Problems ( ) 1 exp. 2. n! e λ and

MATH4427 Notebook 4 Fall Semester 2017/2018

, 0 x < 2. a. Find the probability that the text is checked out for more than half an hour but less than an hour. = (1/2)2

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

Design of Engineering Experiments

Mark Scheme (Results) June 2008

Practice Problems Section Problems

EE/CpE 345. Modeling and Simulation. Fall Class 10 November 18, 2002

1. Definition: Order Statistics of a sample.

Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS

Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama

STT 843 Key to Homework 1 Spring 2018

Notes 9 : Infinitely divisible and stable laws

Mathematical Statistics

Review of Probabilities and Basic Statistics

Direction: This test is worth 250 points and each problem worth points. DO ANY SIX

Random Variables. Random variables. A numerically valued map X of an outcome ω from a sample space Ω to the real line R

Methods of evaluating estimators and best unbiased estimators Hamid R. Rabiee

ST5215: Advanced Statistical Theory

MATH c UNIVERSITY OF LEEDS Examination for the Module MATH2715 (January 2015) STATISTICAL METHODS. Time allowed: 2 hours

Chapter 3: Unbiased Estimation Lecture 22: UMVUE and the method of using a sufficient and complete statistic

8 Laws of large numbers

Unbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others.

ECON Fundamentals of Probability

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

Math 362, Problem set 1

Basic Concepts of Inference

8: Hypothesis Testing

What if There Were No Law of Large Numbers?

Week 2: Review of probability and statistics

6 The normal distribution, the central limit theorem and random samples

Review. December 4 th, Review

Practice Questions for Final

Brief Review of Probability

1* (10 pts) Let X be a random variable with P (X = 1) = P (X = 1) = 1 2

Math Review Sheet, Fall 2008

ORF 245 Fundamentals of Statistics Chapter 4 Great Expectations

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

An inferential procedure to use sample data to understand a population Procedures

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example

MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL

EE/CpE 345. Modeling and Simulation. Fall Class 9

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests

Continuous Random Variables

Qualifying Exam in Probability and Statistics.

Institute of Actuaries of India

Week 9 The Central Limit Theorem and Estimation Concepts

2. Variance and Covariance: We will now derive some classic properties of variance and covariance. Assume real-valued random variables X and Y.

Chapter 2: Resampling Maarten Jansen

Math 447. Introduction to Probability and Statistics I. Fall 1998.

Lecture Notes 5 Convergence and Limit Theorems. Convergence with Probability 1. Convergence in Mean Square. Convergence in Probability, WLLN


Mathematical statistics

STAT 450: Final Examination Version 1. Richard Lockhart 16 December 2002

STAT 830 Non-parametric Inference Basics

Hypothesis testing: theory and methods

5 Introduction to the Theory of Order Statistics and Rank Statistics

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons

Continuous Random Variables

Non-parametric Inference and Resampling

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER.

p. 6-1 Continuous Random Variables p. 6-2

Properties of Random Variables

Mathematical statistics

Summary of basic probability theory Math 218, Mathematical Statistics D Joyce, Spring 2016

STAT 430/510: Lecture 10

Math 562 Homework 1 August 29, 2006 Dr. Ron Sahoo

Financial Econometrics and Quantitative Risk Managenent Return Properties

A Very Brief Summary of Statistical Inference, and Examples

Part IB Statistics. Theorems with proof. Based on lectures by D. Spiegelhalter Notes taken by Dexter Chua. Lent 2015

IB Mathematics HL Year 2 Unit 7 (Core Topic 6: Probability and Statistics) Valuable Practice

STAT100 Elementary Statistics and Probability

Transcription:

Math 466/566 - Homework 4. We want to test a hypothesis involving a population proportion. The unknown population proportion is p. The null hypothesis is p = / and the alternative hypothesis is p > /. We want the level of the test to be., and we want the power of the test to be at least.9 when p >.55. Determine how large a sample we need and specify what the test is, i.e., when we accept the null hypothesis and when we reject it. Note that we worked out the power function for this setting of a one sided alternative in class. (It is also worked out in the book. If you want to make the problem a little more challenging you can instead take the alternative hypothesis to be that p /. 466 Solution: The statistic we use is Z = f.5.5(.5/n ( For a level of. with a one-sided test, we use P (Z >.33 =. to see that the test should be that we reject the null hypothesis if Z >.33. Over the range p.55, the power will be smallest at p =.55. So we want the power at p =.55 to be.9. When p =.55, Z is not a standard normal RV anymore. The standard normal now is Z = f.55.55(.55/n ( Note that.5(.5 =.5 while.55(.55 =.4975. So it is a good approximation to take Z to be Z = f.55.5(.5/n (3 We reject the null hypothesis if Z >.33. In terms of Z this corresponds to P (Z >.33 = P (Z >.33 + (.5.55 n = P (Z >.33. n (4 We want this to equal.9. Using tables of software we find P (Z >.8 =.9. So.33. n =.8 (6 (5

Solving for n we get n must be approximately 3. 566 Solution: The 566 version had an alternative hypothesis of p /. So to get a level of., you need c =.58, i.e., we reject the null hypothesis if Z >.58. An analysis similar to the above than shows to get the desired power we must take n to be about 49.. Problem in chapter 5 in the book. Solution: Follow the book s suggestion. The joint density is f(x, x,, x n = / when x < x < x 3 < x n, and = otherwise. We can compute the mean of U (i using this joint density: E[U (i ] = f(x, x,, x n x j dx dx dx n = ( ( x x j dx dx dx 3 dx 4 dx n First do the integral without the x j : ( ( ( ( ( ( x ( x6 x 3dx 3 3! x3 4dx 4 dx n 4! x4 5dx 5 dx n dx dx dx 3 dx 4 dx n x dx dx 3 dx 4 dx n dx 4 dx n = = Now consider what happens with the x j in there. It will go along for the ride until we do the x j integral. Then instead of x j+ x j j dx j we will have xj+ x j j dx j. So we will skip over the factor of /j and generate a factor of /(j +. So in the end we get 3 (j (j + (n + = j n + (7

Grading note: Any attempt at this problem got the full 5 points. 3. Problem in chapter 5 in the book. Solution: The Cauchy distribution does not have moments of any order. In particular the variance is infinite, so the theoretical variance of the sample mean is infinite. The variance of the sample median is approximately /(4f(m n. Here f(m = c(/s = /(πs. So σ M = π 4 49 = 5.3 So the standard deivation of the sample median is.4. Theoretically, the sample median should be a better estimator of the median than the sample mean. For the data we find the sample mean is 89.49 while the sample median is.5. The sample median is indeed much closer to the population median of. 4. Problem 3 in chapter 5 in the book. If you are using R you will find the following functions useful. pnorm( computes the c.d.f. of a normal distribution. sort( puts a sample in increasing order. (:n-.5/n will create an array with the numbers (i.5/n for i =,,, n. Solution: The null hypothesis is that the population is normal with the given mean and variance. The alternative hypothesis is that it is not. As stated in class, P (D >.36/ n =.5. So if we use a significance level of 5%, we will reject the null hypothesis if D >.36/ n =.3. Using R or your favorite software, you should find for the data set that the KS statistic D =.83. So we accept the null hypothesis that the data is normal with the given mean and variance. 5. Suppose X (the population is a continuous random variable with probability density function f(x = λ exp( λ x µ, < x < (8 where λ and µ are unknown parameters. The mean of X is µ and the density is symmetric about x = µ, so the median is also µ. We are given a random sample X, X,, X n with n large. Determine whether the sample mean or the sample median is a better estimator for µ. By better I mean has smaller variance. 3

Solution: First we compute the population variance. var(x = λ = λ (x µ e λ x µ dx = λ x e λ x dx x e λx dx = λ (9 Thus the variance of the sample mean is σ /n which is var( X n = λ n For the sample median the variance is var(m n = 4f (µn = λ n ( So the sample median is the better estimator. 6. The random variable X is uniformly distributed on the interval [, θ]. (This is the population. θ is an unknown parameter. We have a random sample X of size. We want to use it to estimate the unknown parameter θ. Consider estimators of the form T = cx where c is a constant. (a Find the value of c which makes this an unbiased estimator. Solution: We want E[T ] = θ. E[T ] = ce[x ] = cθ/. So c =. (b Find the value of c which minimizes the mean square error. This is the risk when we take the loss function to be (T θ. Solution: var(x = θ /, and the bias is cθ/ θ. So the MSE is c θ + ( cθ θ Note that we can factor out a θ, which implies that the minimizing c will not depend on θ. Minimize this as a function of c and you should get c = 3/. (c, 566 only Find the value of c which makes this an unbiased estimator. Solution: E[X (n ] = n/(n +, so c = (n + /n. (d, 566 only Find the value of c which minimizes the mean square error. This is the risk when we take the loss function to be (T θ. 4

Solution: Compute the mean square error as above using the formula from the book for the variance of X (n. You should get c = (n + (n + + n(n + 5