ECE 6540, Lecture 06 Sufficient Statistics & Complete Statistics Variations

Similar documents
Variations. ECE 6540, Lecture 02 Multivariate Random Variables & Linear Algebra

Variations. ECE 6540, Lecture 10 Maximum Likelihood Estimation

Continuous Random Variables

Time Domain Analysis of Linear Systems Ch2. University of Central Oklahoma Dr. Mohamed Bingabr

(1) Correspondence of the density matrix to traditional method

CHAPTER 5 Wave Properties of Matter and Quantum Mechanics I

EE 341 A Few Review Problems Fall 2018

Angular Momentum, Electromagnetic Waves

EE120 Fall 2016 HOMEWORK 3 Solutions. Problem 1 Solution. 1.a. GSI: Phillip Sandborn

General Strong Polarization

Lecture 3. STAT161/261 Introduction to Pattern Recognition and Machine Learning Spring 2018 Prof. Allie Fletcher

Wave Motion. Chapter 14 of Essential University Physics, Richard Wolfson, 3 rd Edition

General Strong Polarization

Charge carrier density in metals and semiconductors

Chapter 5: Spectral Domain From: The Handbook of Spatial Statistics. Dr. Montserrat Fuentes and Dr. Brian Reich Prepared by: Amanda Bell

(1) Introduction: a new basis set

Math 171 Spring 2017 Final Exam. Problem Worth

Module 7 (Lecture 27) RETAINING WALLS

Rotational Motion. Chapter 10 of Essential University Physics, Richard Wolfson, 3 rd Edition

Chemical Engineering 412

A Step Towards the Cognitive Radar: Target Detection under Nonstationary Clutter

Gradient expansion formalism for generic spin torques

Work, Energy, and Power. Chapter 6 of Essential University Physics, Richard Wolfson, 3 rd Edition

Radial Basis Function (RBF) Networks

1. (Regular) Exponential Family

2.4 Error Analysis for Iterative Methods

CDS 101/110: Lecture 6.2 Transfer Functions

Lecture 3 Optimization methods for econometrics models

Estimate by the L 2 Norm of a Parameter Poisson Intensity Discontinuous

Prof. Dr.-Ing. Armin Dekorsy Department of Communications Engineering. Stochastic Processes and Linear Algebra Recap Slides

5.6 Multistep Methods

EECE 2150 Circuits and Signals Final Exam Fall 2016 Dec 9

Introduction to Density Estimation and Anomaly Detection. Tom Dietterich

PHY103A: Lecture # 4

SVARs & identification, and advanced time series (continuous time processes and the frequency domain)

Support Vector Machines. CSE 4309 Machine Learning Vassilis Athitsos Computer Science and Engineering Department University of Texas at Arlington

CHAPTER 4 Structure of the Atom

Yang-Hwan Ahn Based on arxiv:

Kalman Filters for Mapping and Localization

BIOE 198MI Biomedical Data Analysis. Spring Semester Lab 4: Introduction to Probability and Random Data

7.3 The Jacobi and Gauss-Seidel Iterative Methods

PHY103A: Lecture # 9

Extreme value statistics: from one dimension to many. Lecture 1: one dimension Lecture 2: many dimensions

Unit WorkBook 4 Level 4 ENG U8 Mechanical Principles 2018 UniCourse Ltd. All Rights Reserved. Sample

General Strong Polarization

Non-Parametric Weighted Tests for Change in Distribution Function

Fluids in Rigid-Body Motion

Elastic light scattering

10.4 Controller synthesis using discrete-time model Example: comparison of various controllers

A new procedure for sensitivity testing with two stress factors

Last Name _Piatoles_ Given Name Americo ID Number

Radiation. Lecture40: Electromagnetic Theory. Professor D. K. Ghosh, Physics Department, I.I.T., Bombay

Lecture No. 1 Introduction to Method of Weighted Residuals. Solve the differential equation L (u) = p(x) in V where L is a differential operator

General Strong Polarization

Dressing up for length gauge: Aspects of a debate in quantum optics

CHAPTER 2 Special Theory of Relativity

Estimation and Detection

Mathematics Ext 2. HSC 2014 Solutions. Suite 403, 410 Elizabeth St, Surry Hills NSW 2010 keystoneeducation.com.

On The Cauchy Problem For Some Parabolic Fractional Partial Differential Equations With Time Delays

Diffusive DE & DM Diffusve DE and DM. Eduardo Guendelman With my student David Benisty, Joseph Katz Memorial Conference, May 23, 2017

Lecture 3. Linear Regression

Q. 1 Q. 25 carry one mark each.

Problem 4.1 (Verdeyen Problem #8.7) (a) From (7.4.7), simulated emission cross section is defined as following.

International Journal of Scientific & Engineering Research, Volume 7, Issue 8, August ISSN MM Double Exponential Distribution

Charged-Particle Interactions in Matter

Quantum Mechanics. An essential theory to understand properties of matter and light. Chemical Electronic Magnetic Thermal Optical Etc.

Worksheets for GCSE Mathematics. Quadratics. mr-mathematics.com Maths Resources for Teachers. Algebra

Property Testing and Affine Invariance Part I Madhu Sudan Harvard University

Integrating Rational functions by the Method of Partial fraction Decomposition. Antony L. Foster

Review for Exam Hyunse Yoon, Ph.D. Adjunct Assistant Professor Department of Mechanical Engineering, University of Iowa

ECE531 Lecture 8: Non-Random Parameter Estimation

TECHNICAL NOTE AUTOMATIC GENERATION OF POINT SPRING SUPPORTS BASED ON DEFINED SOIL PROFILES AND COLUMN-FOOTING PROPERTIES

ABSTRACT OF THE DISSERTATION

EECE 2150 Circuits and Signals Final Exam Fall 2016 Dec 12

A linear operator of a new class of multivalent harmonic functions

Specialist Mathematics 2019 v1.2

Haar Basis Wavelets and Morlet Wavelets

Coarse-Graining via the Mori-Zwanzig Formalism

(2) Orbital angular momentum

A Circular Cylinder. 1 Introduction. RITA CHOUDHURY 1 and BIBHASH DEKA 2

Review for Exam Hyunse Yoon, Ph.D. Assistant Research Scientist IIHR-Hydroscience & Engineering University of Iowa

Secondary 3H Unit = 1 = 7. Lesson 3.3 Worksheet. Simplify: Lesson 3.6 Worksheet

Lise Meitner, Otto Hahn. Nuclear Fission Hans-Jürgen Wollersheim

Physics 371 Spring 2017 Prof. Anlage Review

Equation Sheet, Phys 1321 (Exam II), University of Houston, Fall 2016 Instructor: Dr. W. P. Su

ME5286 Robotics Spring 2017 Quiz 2

Module 7 (Lecture 25) RETAINING WALLS

Grover s algorithm. We want to find aa. Search in an unordered database. QC oracle (as usual) Usual trick

CHEMISTRY Spring, 2018 QUIZ 1 February 16, NAME: HANS ZIMMER Score /20

Random Vectors Part A

Detection & Estimation Lecture 1

COMPRESSION FOR QUANTUM POPULATION CODING

Chapter 8: Least squares (beginning of chapter)

Analysis of Survival Data Using Cox Model (Continuous Type)

MATH 1080: Calculus of One Variable II Fall 2018 Textbook: Single Variable Calculus: Early Transcendentals, 7e, by James Stewart.

Kinetic Model Parameter Estimation for Product Stability: Non-uniform Finite Elements and Convexity Analysis

Expectation Propagation performs smooth gradient descent GUILLAUME DEHAENE

Gibbs Sampler Derivation for Latent Dirichlet Allocation (Blei et al., 2003) Lecture Notes Arjun Mukherjee (UH)

Module 6 (Lecture 22) LATERAL EARTH PRESSURE

Effect of First Order Chemical Reaction for Coriolis Force and Dust Particles for Small Reynolds Number in the Atmosphere Over Territory

Transcription:

ECE 6540, Lecture 06 Sufficient Statistics & Complete Statistics Variations

Last Time Minimum Variance Unbiased Estimators Sufficient Statistics Proving t = T(x) is sufficient Neyman-Fischer Factorization 2

Practice Consider yy = xx + ww ww ~ NN 00, ΣΣ What is the distribution of yy yy + 11σσ 2 yy/nn HHHH HHyy yy TT HH yytt 11 NN yy TT xx xx TT ΛΛyy 3

Neyman-Fischer Factorization Let s reconsider the XX 1, XX 2, XX 3, XX NN ~ iid NN μμ, σσ 2 With unknown θθ = μμ σσ 2 pp xx; θθ = = 1 1 exp xx μμ11 2 2ππσσ2 NN/2 2σσ2 1 1 exp 2ππσσ2 NN/2 2σσ 2 xxtt xx 2μμxx TT 11 + NNμμ 2 Are these sufficient statistics for μμ and/or σσ 2? xx TT 11 xx TT xx xx μμ11 2 xx TT xx 2μμxx TT 11 xx TT 11 + 94 xx xx TT 11/NN xx xxtt 11 NN 11 22 xx TT xx 2xx TT 11 xx TT 11 xx TT xx 4

Neyman-Fischer Factorization Let s reconsider the XX 1, XX 2, XX 3, XX NN ~ iid NN μμ, σσ 2 With unknown θθ = μμ σσ 2 pp xx; θθ = = 1 1 exp xx μμ11 2 2ππσσ2 NN/2 2σσ2 1 1 exp 2ππσσ2 NN/2 2σσ 2 xxtt xx 2μμxx TT 11 + NNμμ 2 Are these sufficient statistics for μμ or σσ 2? xx TT 11 xx TT xx xx μμ11 2 xx TT xx 2μμxx TT 11 xx TT xx 2xx TT 11 xx TT 11 xx TT xx xx TT 11 + 94 xx xx TT 11/NN xx xxtt 11 NN 11 22 = xx TT 11/NN xx TT xx 2 xxtt 11 22 NN Contains only xx TT 11 and xx TT xx + xxtt 11 22 5

Last Time Consider the random variables XX 1, XX 2, XX 3, XX NN are iid such that XX ii 1,2,3,, MM Pr XX = ii = pp ii θθ = pp 1 pp 2 pp MM What is this describing? 6

Practice Consider the random variables XX 1, XX 2, XX 3, XX NN are iid such that XX ii 1,2,3,, MM Pr XX = ii = pp ii θθ = pp 1 pp 2 pp MM δδ pp xx; θθ = pp xx 1 δδ 1 pp xx 2 δδ 2 pp xx 3 δδ xx MM 3 pp MM 7

Practice Consider the random variables XX 1, XX 2, XX 3, XX NN are iid such that XX ii 1,2,3,, MM Pr XX = ii = pp ii θθ = pp 1 pp 2 pp MM δδ pp xx; θθ = pp xx 1 δδ 1 pp xx 2 δδ 2 pp xx 3 δδ xx MM 3 pp MM pp xx; θθ = NN nn=1 pp xx; θθ = pp NN nn=1 1 pp 1 δδ xx nn 1 pp2 δδ xx nn 2 pp3 δδ xx nn 3 ppmm δδ xx nn MM δδ xx ii 1 NN nn=1 pp2 δδ xx ii 2 NN nn=1 pp3 δδ xx ii 3 NN nn=1 ppmm δδ xx ii MM gg TT xx = NN nn=1 δδ xx ii 1 A Histogram!!! NN nn=1 δδ xx ii MM ; θθ 8

Practice Beta Distribution Example Let s consider the XX 1, XX 2, XX 3, XX NN ~ iid Beta αα, ββ with unknown ββ p xx ii ; ββ = 1 BB αα,ββ xx ii αα 1 1 xx ii ββ 1 for 0 xx ii 1 Determine a sufficient statistic for ββ See Homework!! Source: https://en.wikipedia.org/wiki/beta_distribution 9

Neyman-Fischer Factorization Note about minimum sufficient statistics Kay s definition is misleading According to Kay: The data set that contains the least number of elements is called the minimal one. More correct definition A sufficient statistic SS(xx) is minimal sufficient if there exists a function ff such that SS(XX) = ff(tt(xx)) for any sufficient statistic TT(XX) That is, if a function of any other sufficient statistic from can make SS(XX), then SS(XX) is a minimal sufficient statistic 10

Using Sufficient Statistics to Find the MVU Estimator

Rao-Blackwell-Lehmann-Scheffe Theorem If θθ is an unbiased estimator of θθ and TT xx is a sufficient statistic for θθ, then θθ = E θθ TT xx is 1) A valid estimator for θθ (not dependent on θθ) 2) Unbiased TT xx represents more information 3) Of lesser or equal variance than that of θθ, for all θθ (each element of θθ has lesser or equal variance) 4) If the sufficient statistic is complete, then θθ is the MVU estimator 12

Rao-Blackwell-Lehmann-Scheffe Theorem If θθ is an unbiased estimator of θθ and TT xx is a sufficient statistic for θθ, then θθ = E θθ TT xx is. Use another Use another suff. statistic suff. statistic TT 1 xx TT 2 xx θθ 1 θθ 2 A smaller variance estimate! θθ 3 A smaller variance estimate! Use another suff. statistic TT 3 xx θθ 4 A smaller variance estimate! θθ Smallest variance estimate! 13

Rao-Blackwell-Lehmann-Scheffe Theorem If θθ is an unbiased estimator of θθ and TT xx is a sufficient statistic for θθ, then θθ = E θθ TT xx is. Use a complete statistic TT 1 xx θθ 1 θθ Smallest variance estimate! 14

Using Complete, Sufficient Statistics to Find the MVU Estimator

A Complete Statistics A statistic is complete if there is only one function gg of the statistic t = TT xx that is unbiased. That is, A statistic is complete if only one gg exists such that EE gg TT xx = θθ 16

Procedure for finding the MVU Estimator (using sufficient statistics) 1. Find a sufficient statistic TT xx using the Neyman-Fisher factorization 2. Show that TT xx is complete (if not, stop) 3. The MVU estimator is 1. θθ = EE θθ TT xx where θθ is any unbiased estimator for θθ Variance Space of all unbiased estimators Space of all sufficient unbiased estimators MVUB estimator In the end though, proving that a statistic TT xx is complete is difficult 17

Example: Consider the signal xx = AA11 + ww ww ~ NN 0, IIσσ 22 Show that tt = TT xx = NN nn=1 xx ii is complete. DC voltage 18

Example: Consider the signal xx = AA11 + ww ww ~ NN 0, IIσσ 22 Show that tt = TT xx = NN nn=1 xx ii is complete. DC voltage Assume the exists a function gg such that E gg tt = AA Assume there exists a second function h such that E h tt = AA Then E gg tt h tt = AA AA = 0 Let vv tt = gg tt h tt Then E vv tt = vv tt pp tt; θθ ddtt = 0 19

Example: Consider the signal xx = AA11 + ww ww ~ NN 0, IIσσ 22 Show that tt = TT xx = NN nn=1 xx ii is complete. DC voltage Then E vv tt = vv tt pp tt; θθ ddtt = 0 tt ~ NN NNNN, NNσσ 22, so E vv tt = vv tt 1 2 ππππ σσ 2 exp tt NNNN 2 2NNσσ 2 dddd = 0 If ττ = tt, E vv tt = NN vv ττ NN 2 ππππ σσ NN exp AA ττ 2 ddττ = 0 2 2σσ 2 vvv ττ ff AA ττ 20

Example: Consider the signal xx = AA11 + ww ww ~ NN 0, IIσσ 22 Show that tt = TT xx = NN nn=1 xx ii is complete. DC voltage If ττ = tt, E vv tt = NN vv ττ vvv ττ NN 2 ππππ σσ NN exp AA ττ 2 ddττ = 0 2 2σσ 2 ff AA ττ E vv tt = vv ττ ff AA ττ dddd = vv ττ ff ττ = 0 VV ωω FF ωω = 0 21

Example: Consider the signal xx = AA11 + ww ww ~ NN 0, IIσσ 22 Show that tt = TT xx = NN nn=1 xx ii is complete. DC voltage VV ωω FF ωω = 0 Fourier Transform of a Gaussian is Gaussian, so not zero Therefore, VV ωω = 0 vv ττ = 0 vv tt = 0 gg tt h tt = 0 Hence gg tt = h tt -> There can only be one unbiased function! TT xx is a complete statistic! 22

A Complete Statistics A statistic is complete if there is only one function gg of the statistic t = TT xx that is unbiased. That is, A statistic is complete if only one gg exists such that EE gg TT xx = θθ Alternatively, A statistic is complete if vv tt pp tt; θθ ddtt = EE vv TT xx = 0, for all θθ Is satisfied only when vv tt = 0 for all tt = TT xx. 23

There is one simplification Consider the Fischer Factorization: pp xx; θθ = gg tt; θθ h xx If gg tt; θθ has the form gg tt; θθ = cc θθ exp aa kk θθ tt kk KK kk=1 Then pp xx; θθ and pp tt; θθ are said to be in the exponential family And tt = TT xx is a complete, sufficient statistic Many common PDFs satisfy this. 24

Example: Are the following distributions in the exponential family? Exponential: pp xx; λλ = λλee λλλλ, xx 0 0, xx < 0 Gamma: pp xx; kk, θθ = xkk 1 ee xx θθ θθ kk Γ kk Gaussian: pp xx; μμ, σσ 2 = 2ππσσ 1 1 2 1/2 exp 2 xx μμ 2 2σσ 25

Example Let XX 1, XX 2, XX 3, XX NN ~ iid Bernoulli θθ Determine a complete, sufficient statistics for θθ Determine an MVUB Estimator 26

Example Let XX 1, XX 2, XX 3, XX NN ~ iid Bernoulli θθ NN pp xx; θθ = ii=1 θθ xx ii θθ 1 1 xx ii = θθ NN ii=1 xxii 1 θθ NN NN ii=1 xxii tt tt = 1 1 θθ NN θθ 1 θθ h xx Hence, NN ii=1 gg ii=1 NN TT xx = xx ii ; θθ NN xxii = 1 1 θθ NN exp ii=1 xx ii is a complete sufficient statistic NN ii=1 xx ii gg tt; θθ = cc θθ exp log θθ 1 θθ KK aa kk θθ tt kk kk=1 27

Example Let XX 1, XX 2, XX 3, XX NN ~ iid Bernoulli θθ The expected value of the sufficient statistic is NN EE tt = EE ii=1 xx ii NN = EE xx ii ii=1 = NNθθ Therefore tt is an unbiased estimator. Since t is complete, it is also MVUB. So the NN MVUB estimator is NN θθ = ii=1 xx ii 28

Example Let XX 1, XX 2, XX 3, XX NN ~ iid NN θθ, 1 Determine a complete, sufficient statistic for θθ 29

Example Let XX 1, XX 2, XX 3, XX NN ~ iid NN θθ, 1 pp xx; θθ = 1 2ππ NN/2 exp 1 2 xx θθ11 2 = 1 2ππ NN/2 exp 1 2 xxtt xx 2θθxx TT 11 + NNθθ 2 = 1 2ππ NN/2 exp 1 2 2θθxxTT 11 + NNθθ 2 exp xxtt xx 2 = 1 2ππ NN/2 exp 1 2 2θθθθ xx + NNθθ2 exp xxtt xx 2 gg TT xx = xx TT 11; θθ h xx = cc θθ exp aa kk θθ tt kk KK kk=1 30

Example Let XX 1, XX 2, XX 3, XX NN ~ iid NN θθ, 1 The unbiased complete, sufficient statistic is xx TT 11/NN Hence the MVUB estimator is θθ = xx TT 11/NN 31

Example Let XX 1, XX 2, XX 3, XX NN ~ iid NN θθ, σσ 2 with unknown θθ, σσ 2 Determine a complete, sufficient statistic for θθ and σσ 2 32

Example Let XX 1, XX 2, XX 3, XX NN ~ iid NN θθ, σσ 2 with unknown θθ, σσ 2 pp xx; θθ = = 1 1 exp xx θθ11 2 2ππσσ2 NN/2 2σσ2 1 1 exp 2ππσσ2 NN/2 2σσ 2 xxtt xx 2θθxx TT 11 + NNθθ 2 = 1 2ππσσ 2 NN 2 exp 1 2σσ 2 TT 2 xx 2θθTT 1 xx + NNθθ 2 1 gg tt = xxtt 11 xx TT xx ; θθ σσ 2 h xx gg tt; θθ = cc θθ exp KK aa kk θθ tt kk kk=1 33

Example Let XX 1, XX 2, XX 3, XX NN ~ iid NN θθ, σσ 2 with unknown θθ, σσ 2 The unbiased complete, sufficient statistic for θθ is xx TT 11/NN The unbiased complete, sufficient statistic for σσ 2 is xx xxtt 11 NN 11 22 Hence the MVUB estimators are θθ = xx TT 11/NN σσ 2 = xx xxtt 11 NN 11 22 34

Beta Distribution Example Let s consider the XX 1, XX 2, XX 3, XX NN ~ iid Beta αα, ββ with unknown ββ p xx ii ; ββ = 1 BB αα,ββ xx ii αα 1 1 xx ii ββ 1 for 0 xx ii 1 Determine a complete, sufficient statistic for ββ See Homework!!! 35

Consider XX 1, XX 2, XX 3, XX NN ~ iid exponential λλ The statistic NN tt = nn=1 Is a complete, sufficient statistic for λλ. What is the MVUB estimator? xx nn 36

Consider XX 1, XX 2, XX 3, XX NN ~ iid exponential λλ The statistic NN tt = nn=1 Is a complete, sufficient statistic for λλ. What is the MVUB estimator? tt~erlang NN, λλ or Gamma NN, 1 λλ xx nn EE tt = 1 λλ 1 tt ~ InvGamma NN, λλ EE 1 = λλ = λλ 1 tt NN 1 NN 1 So the MVUB estimator is λλ = NN 1 1 NN nn=1 xx nn 37

Procedure for finding the MVU Estimator (using sufficient statistics) For xx = AA11 + ww, ww ~ NN 0, IIσσ 22 1. Find a sufficient statistic TT xx using the Neyman-Fisher factorization 2. Show that TT xx is complete (if not, stop) 3. The MVU estimator is 1. θθ = EE θθ TT xx where θθ is any unbiased estimator for θθ

In the end, Applying Neyman-Fisher factorization can be difficult Proving that a statistic is complete can be difficult (unless you have an exponential family PDF) Applying Rao-Blackwell-Lehmann-Scheffe Theorem can be difficult What else can we do???? 39