AAEC/ECON 5126 FINAL EXAM: SOLUTIONS

Similar documents
Topic 9: Sampling Distributions of Estimators

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Topic 9: Sampling Distributions of Estimators

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

Estimation for Complete Data

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

Topic 9: Sampling Distributions of Estimators

Introducing Sample Proportions

Review Questions, Chapters 8, 9. f(y) = 0, elsewhere. F (y) = f Y(1) = n ( e y/θ) n 1 1 θ e y/θ = n θ e yn

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

MATH 472 / SPRING 2013 ASSIGNMENT 2: DUE FEBRUARY 4 FINALIZED

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Frequentist Inference

Introducing Sample Proportions

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Bayesian Methods: Introduction to Multi-parameter Models

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5

Sampling Distributions, Z-Tests, Power

AMS570 Lecture Notes #2

Expectation and Variance of a random variable

Estimation of a population proportion March 23,

Lecture 9: September 19

Statistical Theory MT 2008 Problems 1: Solution sketches

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

(7 One- and Two-Sample Estimation Problem )

Exponential Families and Bayesian Inference

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1

Statistics 300: Elementary Statistics

Statistical Theory MT 2009 Problems 1: Solution sketches

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

IE 230 Seat # Name < KEY > Please read these directions. Closed book and notes. 60 minutes.

Lecture 2: Monte Carlo Simulation

Stat 400: Georgios Fellouris Homework 5 Due: Friday 24 th, 2017

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.


Questions and Answers on Maximum Likelihood

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Binomial Distribution

7.1 Convergence of sequences of random variables

Kurskod: TAMS11 Provkod: TENB 21 March 2015, 14:00-18:00. English Version (no Swedish Version)

Chapter 8: Estimating with Confidence

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

AP Statistics Review Ch. 8

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Measures of Spread: Standard Deviation

M06/5/MATHL/HP2/ENG/TZ0/XX MATHEMATICS HIGHER LEVEL PAPER 2. Thursday 4 May 2006 (morning) 2 hours INSTRUCTIONS TO CANDIDATES

GUIDELINES ON REPRESENTATIVE SAMPLING

Random Signals and Noise Winter Semester 2017 Problem Set 12 Wiener Filter Continuation

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all!

P1 Chapter 8 :: Binomial Expansion

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Parameter, Statistic and Random Samples

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

This is an introductory course in Analysis of Variance and Design of Experiments.

Math 10A final exam, December 16, 2016

ECO 312 Fall 2013 Chris Sims LIKELIHOOD, POSTERIORS, DIAGNOSING NON-NORMALITY

Solutions to Odd Numbered End of Chapter Exercises: Chapter 4

STP 226 EXAMPLE EXAM #1

Data Analysis and Statistical Methods Statistics 651

7.1 Convergence of sequences of random variables

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

LECTURE NOTES 9. 1 Point Estimation. 1.1 The Method of Moments

IE 230 Probability & Statistics in Engineering I. Closed book and notes. No calculators. 120 minutes.

Stat 200 -Testing Summary Page 1

Confidence Intervals for the Population Proportion p

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

Probability and MLE.

Element sampling: Part 2

x = Pr ( X (n) βx ) =

Chapter 23: Inferences About Means

Lecture 7: Properties of Random Samples

1 Models for Matched Pairs

Axis Aligned Ellipsoid

Understanding Samples

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

1 Inferential Methods for Correlation and Regression Analysis

2.4 Sequences, Sequences of Sets

PRACTICE PROBLEMS FOR THE FINAL

Chapter 6 Principles of Data Reduction

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

MATH/STAT 352: Lecture 15

Topic 10: The Law of Large Numbers

Central Limit Theorem the Meaning and the Usage

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

6. Sufficient, Complete, and Ancillary Statistics

Transcription:

AAEC/ECON 5126 FINAL EXAM: SOLUTIONS SPRING 2015 / INSTRUCTOR: KLAUS MOELTNER This exam is ope-book, ope-otes, but please work strictly o your ow. Please make sure your ame is o every sheet you re hadig i. You have 120 miutes to complete this exam. You ca collect a maximum of 50 poits. Each questio is scored as idicated below. Vectors are give i lower-case boldface. Matrices are writte i upper-case boldface. Questio I 20 poits): Estimatig a Populatio Proportio via Maximum Likelihood You are researchig groudwater cotamiatio i a commuity of homes, all of which receive their water supply through a private well. You have a sample of wells, y of which are cotamiated y, of course). Your parameter of iterest is θ, the proportio of cotamiated wells i the etire commuity. You believe the sample likelihood follows a Biomial distributio, give as L θ) = θ y 1 θ) y, θ [0, 1], y = 1, 2,..., where! =!y! 1) I this otatio, is the umber of trials the umber of wells i your sample), ad y is the umber of successes the umber of cotamiated wells i your sample). The expectatio ad variace of the biomial are give, respectively, as E y θ, ) = θ, ad V y θ, ) = θ 1 θ). Part a) 14 poits Derive the followig aalytical costructs: 1) The log-likelihood fuctio ll θ) 2 poits) 2) The gradiet g θ) 2 poits) 3) The Hessia H θ) 2 poits) 4) The Iformatio matrix I θ) 2 poits) 5) Show that the score idetity holds. 2 poits) 6) Show that the iformatio idetity holds. 4 poits) Date: May 12, 2015. 1

ll θ) = l + y l θ) + l 1 θ) g θ) = yθ 1 1 θ) 1 = θ 1 θ)) 1 y 1 θ) 1 H θ) = yθ 2 1 θ) 2 I θ) = E H θ)) = use E y θ, ) = θ for y i H θ) ) θ 1 θ) E g θ)) = 0 use E y θ, ) = θ for y i g θ) ) V g θ)) = θ 1 θ)) 2 V y θ, ) = θ 1 θ)) 2 θ 1 θ) = θ 1 θ) = I θ), usig V y θ, ) = θ 1 θ) i the secod expressio of g θ) ). 2) Part b) 6 poits Assume you have a sample of 100 wells, 20 of which are cotamiated. Fid the MLE estimate for θ ad its stadard error. Settig the gradiet from above to zero ad solvig for θ we obtai ˆθ = y, which, for this sample, equals 0.2. The variace of θ is the iverse of the iformatio matrix: V θ) = θ1 θ). So i this case we obtai ˆV ˆθ) = 0.0016. The square root of this gives the stadard error of 0.04. 2

Questio II 30 poits): Estimatig a Populatio Proportio via Bayesia Methods Cotiuig with the problem from above, you have obtaied iformatio from other commuities that are located o the same groudwater aquifer. For these locatios, the average proportio of cotamiated wells is 0.25 with a stadard deviatio of 0.14. You use this iformatio to specify a prior distributio for θ as a Beta desity with shape parameters α 0 = 2 ad β 0 = 6. This desity is give as: p θ) = Γ α 0 + β 0 ) Γ α 0 ) Γ β 0 ) θα 0 1 1 θ) β 0 1 θ [0, 1], 3) where Γ.) deotes the mathematical gamma fuctio. The expectatio ad variace of the Beta are give, respectively, as E θ) = α 0 α 0 +β 0, ad V θ) = α 0 β 0 α 0 +β 0 ) 2 α 0 +β 0 +1). You ca easily verify but you do t have to...) that for the give prior shape parameters α 0 = 2 ad β 0 = 6 we obtai a prior expectatio for θ of 0.25 ad a prior stadard deviatio of approx.) 0.14. Alog with the atural bouds of the Beta at 0 ad 1, this seems ideed like a reasoable prior distributio for θ. Part a) 8 poits Usig the same biomial likelihood as i Q.1 that is, the u-logged versio), fid the posterior distributio of θ, call it p θ y, ). Show that it is agai a Beta with shape parameters α 1 ad β 1, ad show that these posterior parameters are a fuctio of the prior parameters ad the data. Hit: Multiply all relevat parts of the prior with all relevat parts of the likelihood to obtai the posterior kerel. Simplify, ad recogize the resultig kerel as the kerel of aother Beta distributio with parameters α 1 ad β 1. The show the explicit form of α 1 ad β 1. This should take o more tha a miute or two...) p θ y, ) θ α 0 1 1 θ) β 0 1 θ y 1 θ) y = θ α 0+y 1) 1 θ) β 0+ y 1) 4) Thus, θ y, Beta α 1, β 1 ), with α 1 = α 0 + y, ad β 1 = β 0 + y. Part b) 6 poits For your sample of 100 wells with 20 cotamiatio cases, compute the posterior mea ad stadard deviatio of θ. Please be precise to the fourth decimal. First ote that for the give data ad prior parameters, we have α 1 = α 0 + y = 22, ad β 1 = β 0 + y = 86. We ca ow use these values i the give expressios for the mea ad variace 3

of a Beta radom variable: E θ y, ) = α 1 α 1 + β 1 = 0.2037 V θ y, ) = α 1 β 5) 1 α 1 + β 1 ) 2 α 1 + β 1 + 1) = 0.0015 This yields a posterior stadard deviatio of sd θ y, ) = sqrt 0.0015) = 0.0386. Part c) 6 poits Aswer the followig questios: 1) Compare the MLE estimate ad the posterior mea of θ. Why are they at least slightl differet? 2 poits) 2) Has the collected data brought meaigful iformatio to add to the prior? How ca you tell? 2 poits) 3) For the same collected data, how would you expect your posterior mea to chage up, dow, or o chage) if the prior mea of θ had bee 0.4? Explai. 2 poits) 1) The posterior mea is slightly higher due to the ifluece of the prior distributio, which has a higher mea of 0.25. The sample size is ot large eough to completely overpower the prior. 2) Yes - the posterior stadard deviatio is much smaller tha the prior stadard deviatio. So the iformatio from the data has tighteed up the prior distributio. 3) We would expect the posterior mea to icrease, due to the ifluece of the ew prior. Part d) 10 poits Now cosider istead a sample of oly 10 wells, with 2 cotamiatios ad the origial priors from above). Compute the ew MLE estimate ad its stadard error, ad the ew posterior mea ad stadard deviatio. 4 poits) 1) How has the posterior mea chaged compared to its previous value? Why? 2 poits) 2) Why has the posterior mea chaged ad the MLE estimate ot? 2 poits) 3) Compare the ew MLE stadard error ad the ew posterior stadard deviatio. Which oe has icreased more i both absolute ad relative terms), ad why? 2 poits) Usig) all the ew sample iformatio ad the aalytical results from above, we obtai ˆθ = 0.2, s.e ˆ ˆθ = 0.1265, α 1 = 4, β 1 = 14, E θ y, ) = 0.2222, ad sd θ, = 0.0945. 1) The posterior mea has icreased - this is because the prior ifluece is ow stroger tha before due to the smaller sample size. 2) Same aswer as above - the posterior mea is uder the ifluece of the prior, which is completely abset for the MLE estimate. 4

3) The MLE stadard error has icreased much more. I cotrast, the precisio of the posterior desity has suffered less from the decreased sample size. This is due to the relatively high precisio brought to the model by the prior distributio. We have a iformed prior). 5