Masters Comprehensive Examination Department of Statistics, University of Florida

Similar documents
First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida

Masters Comprehensive Examination Department of Statistics, University of Florida

Review Quiz. 1. Prove that in a one-dimensional canonical exponential family, the complete and sufficient statistic achieves the

Problem Selected Scores

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018

Qualifying Exam in Probability and Statistics.

Qualifying Exam in Probability and Statistics.

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics

Qualifying Exam in Probability and Statistics.

McGill University. Faculty of Science. Department of Mathematics and Statistics. Part A Examination. Statistics: Theory Paper

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Problem 1 (20) Log-normal. f(x) Cauchy

BEST TESTS. Abstract. We will discuss the Neymann-Pearson theorem and certain best test where the power function is optimized.

UNIVERSITY OF TORONTO Faculty of Arts and Science

BIOS 2083 Linear Models c Abdus S. Wahed

Class 26: review for final exam 18.05, Spring 2014

IIT JAM : MATHEMATICAL STATISTICS (MS) 2013

Probability Distributions Columns (a) through (d)

Master s Written Examination - Solution

Class 8 Review Problems 18.05, Spring 2014

t x 1 e t dt, and simplify the answer when possible (for example, when r is a positive even number). In particular, confirm that EX 4 = 3.

Spring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n =

STA 2101/442 Assignment 2 1

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

MAT 271E Probability and Statistics

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Joint Probability Distributions and Random Samples (Devore Chapter Five)

HT Introduction. P(X i = x i ) = e λ λ x i

STA 2201/442 Assignment 2

Unbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others.

Math 494: Mathematical Statistics

STAT2201. Analysis of Engineering & Scientific Data. Unit 3

UC Berkeley Department of Electrical Engineering and Computer Sciences. EECS 126: Probability and Random Processes

Contents 1. Contents

Simple Linear Regression

Association studies and regression

Spring 2012 Math 541B Exam 1

Stat 5102 Final Exam May 14, 2015

Midterm Exam 1 Solution

18.05 Practice Final Exam

Statistics & Data Sciences: First Year Prelim Exam May 2018

Time Series Analysis

Statistics 135 Fall 2008 Final Exam

MAS223 Statistical Inference and Modelling Exercises

Comprehensive Examination Quantitative Methods Spring, 2018

[y i α βx i ] 2 (2) Q = i=1

Chapter 8: Hypothesis Testing Lecture 9: Likelihood ratio tests

Unbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others.

1. (Regular) Exponential Family

18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages

Master s Written Examination

Ch 2: Simple Linear Regression

A Very Brief Summary of Statistical Inference, and Examples

MAT 271E Probability and Statistics

MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL

Probability Theory and Statistics. Peter Jochumzen

General Linear Model: Statistical Inference

7 Random samples and sampling distributions

A Very Brief Summary of Statistical Inference, and Examples

Mathematical statistics

Master s Written Examination

INSTITUTE OF ACTUARIES OF INDIA

Asymptotic Statistics-III. Changliang Zou

Mathematical statistics

Estimation theory. Parametric estimation. Properties of estimators. Minimum variance estimator. Cramer-Rao bound. Maximum likelihood estimators

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:.

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Analysis of Engineering and Scientific Data. Semester

Exam Applied Statistical Regression. Good Luck!

Section 4.6 Simple Linear Regression

Statistics Ph.D. Qualifying Exam: Part II November 3, 2001

HW1 (due 10/6/05): (from textbook) 1.2.3, 1.2.9, , , (extra credit) A fashionable country club has 100 members, 30 of whom are

UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Applied Statistics Friday, January 15, 2016

Chapter 4. Chapter 4 sections

Review. December 4 th, Review

What is Experimental Design?

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56

STAT 450: Final Examination Version 1. Richard Lockhart 16 December 2002

with the usual assumptions about the error term. The two values of X 1 X 2 0 1

Test Code: STA/STB (Short Answer Type) 2013 Junior Research Fellowship for Research Course in Statistics

Reminders. Thought questions should be submitted on eclass. Please list the section related to the thought question

Statistical Distributions and Uncertainty Analysis. QMRA Institute Patrick Gurian

Lecture 15. Hypothesis testing in the linear model

Part 6: Multivariate Normal and Linear Models

COMPARING SEVERAL MEANS: ANOVA

This exam is closed book and closed notes. (You will have access to a copy of the Table of Common Distributions given in the back of the text.

Multiple Linear Regression

Regression Estimation Least Squares and Maximum Likelihood

PhD Qualifying Examination Department of Statistics, University of Florida

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

WISE International Masters

STAT5044: Regression and Anova. Inyoung Kim

Rockefeller College University at Albany

Chapter 7. Hypothesis Testing

Probability on a Riemannian Manifold

Transcription:

Masters Comprehensive Examination Department of Statistics, University of Florida May 6, 003, 8:00 am - :00 noon Instructions: You have four hours to answer questions in this examination You must show your work to receive credit 3 There are 0 problems of which you must answer 8 4 Only your first 8 problems will be graded 5 The last page contains two tables that you may need 6 While the 0 questions are equally weighted, some problems are more difficult than others 7 The parts within a given question are not necessarily equally weighted 8 Write only on one side of the paper, and start each question on a new page 9 You are allowed to use a calculator The following abbreviations are used throughout: ANOVA = analysis of variance CDF = cumulative distribution function CRD = completely randomized design iid = independent and identically distributed LRT = likelihood ratio test mgf = moment generating function ML = maximum likelihood pdf = probability density function pmf = probability mass function UMVUE = uniformly minimum variance unbiased estimator ε i NID(0, σ ) means that the ε i s are iid N(0, σ )

Consider the CRD, stated as the cell means model with 3 fixed treatments, 5 replicates per treatment, and the following error structure: y ij = µ i + ε ij ε ij NID(0, σ ) i =,, 3; j =,, 5 (a) Give the Treatment and Error sums of squares in symbolic form and derive the expected mean square for treatments Hint: (x i x ) = x i nx Now suppose the sample means (standard deviations) for the three treatments are: 50 (3), 60 (), 40 (4) (b) Obtain the Analysis of Variance (c) Test H 0 : µ = µ = µ 3 vs H A : Population means are not all equal at α = 005 significance level (d) Use Bonferroni s method with an experiment-wise error rate of α = 005 to compare all pairs of treatments An internet based firm has sources of advertising: newspaper/magazines and television/radio They vary the amounts of advertising of each type (X and X, in thousands of dollars, respectively) across a sample of n = 8 similar sized markets, and obtain how many times their website is reached by people from each market (Y, in 000s of hits ) They fit the multiple regression model: Y i = β 0 + β X i + β X i + ε i ε i NID(0, σ ) and obtain the following estimated regression equation, residual mean square, and (X X) matrix: Ŷ i = 8 + 45X i + 35X i s = 87 ( X X ) = 04 0 0 0 0096 006 0 006 0096 (a) Give the predicted number of hits if they spend $000 on newspaper/magazines and $000 on television/radio (be careful of units) (b) Test whether the mean number of hits when X = 0 and X = 0 is equal to 00 Set this up and conduct it as a test of the form: H 0 : K β = m and test at α = 005 significance level (c) Suppose they fit a response surface and obtained the following least squares estimate of the regression equation containing an intercept, all linear and quadratic terms, and a cross-product term Ŷ = 05 + 60X + 40X 0X 05X + 0X X Their budget is limited to $5000 (recall units), and they must spend all of it How should they allocate their budget between newspaper/magazine ads (X ) and television/radio advertising (X ) to maximize the predicted number of hits based on the estimated regression equation

3 A clothing manufacturer buys three types of yarn They are interested in comparing the three types of yarn in terms of mean breaking strength They obtain random samples of each type from two suppliers and observe the sample mean breaking strengths (in psi) based on 0 replicates per yarn type from each supplier A summary of the data follows Yarn Type Supplier 3 Mean 350 550 750 550 450 450 050 650 Mean 400 500 900 600 (a) Obtain the Analysis of Variance, including the sources: yarn types, suppliers, interaction, and error ( (yij y ) = 50, 000) (b) Assuming that these are the only two suppliers for the manufacturer, test for (i) interaction, (ii) yarn type effects, and (iii) supplier effects by adding test statistics and critical values to your ANOVA table (c) Repeat the previous exercise under the assumption that these two suppliers are a random sample from a large number of suppliers 4 In a study of the effect of Viagra, patients suffering from erectile dysfunction were randomly assigned to one of four doses of Viagra (in mg) The following table gives mean rating scores of erectile function after 4 weeks of treatment, as well as sample sizes (adjusted slightly for ease of computation) Consider two different models for these data: Dose (X i ) Y i r i 0 (Placebo) 00 5 3 00 50 35 00 00 40 00 Model (a): Y ij = β 0 + X i β + ε ij i =,, n; j =,, r i, Model (b): Y i = β 0 + X i β + ε i i =,, n, where ε ij are uncorrelated random variables with mean 0 and variance σ (a) The weighted least squares estimator of (β 0, β ) for Model (b) is the same as the ordinary least squares estimator for Model (a) Write out the matrix form of ˆβ w for Model (b) with numbers (no symbols) that you would enter into a matrix language to get the estimate You need not multiply or invert any of the matrices as long as they are stated properly with correct numbers in the proper form The actual weighted least squares estimate is: ˆβ w = (40, 00) Further, the variances for the 4 dose groups are: 78, 40, 40, and 40, respectively (b) Give the fitted values for subjects in the 4 dose groups based on the weighted least squares estimator of β (c) Conduct the F-test for lack-of-fit to test whether the true relationship is linear at the α = 005 significance level 3

5 In scalar form for the simple linear regression model: Y i = β 0 + β X i + ε i ε i NID(0, σ ), the least squares estimates of β and β 0 are the commonly known values: ˆβ = i= (X i X)(Y i Y ) i= (X i X) ˆβ0 = Y ˆβ X (a) Derive the mean and variance of the sampling distribution of ˆβ (b) Derive the mean of the sampling distribution of ˆβ 0 6 This question deals with moment generating functions and applications to normal random variables (a) Show that if X N(µ, σ ) then Z = (X µ)/σ N(0, ) (b) Let W be a random variable with mgf M W (t) and suppose that a and b are constants Show that the mgf of aw + b can be written as e bt M W (at) (c) Show that the mgf of X is M X (t) = exp {µt + σ t } (d) Derive the distribution of Y = Z (e) Are Y and Z uncorrelated? (f) Are Y and Z independent? (This requires more than a yes or no ) 7 Toss n fair dice Pick up only those not showing a 6 and toss them Continue doing this until all dice show a 6 Let X denote the total number of tosses made in this experiment (a) How many tosses do you expect to make if n =? (b) Compute the CDF of X for general n (Hint: Think about what happens with each individual die) (c) Suppose that Y is a discrete random variable whose support is contained in {0,,, } Show that [ FY (y) ] = EY, y=0 where F Y ( ) is the CDF of Y (Hint: Write the left-hand side as a double sum) (d) Use the result from (c) to find the expected value of X when n = 4

8 Let X, X,, X n be iid with common pdf where θ > 0 f(x θ) = { θx θ 0 < x < 0 otherwise (a) Show that the ML estimator of θ is δ(x,, X n ) = n i= log X i (b) Calculate the expected value and the variance of δ(x,, X n ) (c) Find the Cramér-Rao lower bound for the variance of an unbiased estimator of θ What can you conclude? (d) Find a pivotal quantity that depends on the data only through δ (e) Suppose that n = and that α (0, ) Use the result in (d) to construct an interval estimator for θ with confidence coefficient α Give explicit formulas for the endpoints that involve only X and α 9 Suppose that X,, X n are iid N(0, σ ) and that Y,, Y m are iid N(0, τ ) Assume further that X = (X,, X n ) and Y = (Y,, Y m ) are independent Define λ = σ /τ (a) Find the ML estimator of σ (b) Construct the LRT statistic for testing H 0 : λ = λ 0 against H : λ λ 0 where λ 0 > 0 (c) Show that the LRT statistic can be written in such a way that it involves the data only through the statistic m i= Y i (d) Find the distribution of F under H 0 F = m nλ 0 i= X i (e) The general LRT theory tells us to reject H 0 when the LRT statistic is small Give an equivalent rejection rule in terms of F 0 This question deals with the concept of completeness and its applications (a) Consider a family of pdfs (or pmfs) { f(t θ) : θ Θ } Explain exactly what it means to say that this family is complete (b) Consider a binomial family where the number of trials, n, is a fixed positive integer and the success probability, θ, is restricted to the set { } Θ = 4,, 3 4 Is this family complete? (Remember to show your work!) (c) Let X,, X k be iid Binomial(n, θ) where n is a fixed, known positive integer and θ (0, ) is unknown Find the best unbiased estimator of nθ( θ) n ; that is, the UMVUE of nθ( θ) n 5