Idea is to sample from a different distribution that picks points in important regions of the sample space. Want ( ) ( ) ( ) E f X = f x g x dx

Similar documents
Lecture 3. Sampling, sampling distributions, and parameter estimation

Econometric Methods. Review of Estimation

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

STK4011 and STK9011 Autumn 2016

Simulation Output Analysis

( ) ( ) ( ) f ( ) ( )

Chapter 5 Properties of a Random Sample

Summary of the lecture in Biostatistics

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Chapter 8: Statistical Analysis of Simulated Data

Multiple Linear Regression Analysis

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes

X ε ) = 0, or equivalently, lim

Lecture 9: Tolerant Testing

General Method for Calculating Chemical Equilibrium Composition

Chapter -2 Simple Random Sampling

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

Comparison of Dual to Ratio-Cum-Product Estimators of Population Mean

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes

Module 7. Lecture 7: Statistical parameter estimation

Chapter -2 Simple Random Sampling

2SLS Estimates ECON In this case, begin with the assumption that E[ i

Sampling Theory MODULE X LECTURE - 35 TWO STAGE SAMPLING (SUB SAMPLING)

9 U-STATISTICS. Eh =(m!) 1 Eh(X (1),..., X (m ) ) i.i.d

Lecture Notes Forecasting the process of estimating or predicting unknown situations

Sampling Theory MODULE V LECTURE - 14 RATIO AND PRODUCT METHODS OF ESTIMATION

Lecture 02: Bounding tail distributions of a random variable

CHAPTER VI Statistical Analysis of Experimental Data

Chapter 4 Multiple Random Variables

Bayesian Inferences for Two Parameter Weibull Distribution Kipkoech W. Cheruiyot 1, Abel Ouko 2, Emily Kirimi 3

Multivariate Transformation of Variables and Maximum Likelihood Estimation

BAYESIAN INFERENCES FOR TWO PARAMETER WEIBULL DISTRIBUTION

LECTURE - 4 SIMPLE RANDOM SAMPLING DR. SHALABH DEPARTMENT OF MATHEMATICS AND STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANPUR

Lecture 3 Probability review (cont d)

Special Instructions / Useful Data

PROPERTIES OF GOOD ESTIMATORS

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

22 Nonparametric Methods.

Recall MLR 5 Homskedasticity error u has the same variance given any values of the explanatory variables Var(u x1,...,xk) = 2 or E(UU ) = 2 I

Reliability evaluation of distribution network based on improved non. sequential Monte Carlo method

Functions of Random Variables

Outline. Point Pattern Analysis Part I. Revisit IRP/CSR

Chapter 14 Logistic Regression Models

Discrete Mathematics and Probability Theory Fall 2016 Seshia and Walrand DIS 10b

Chapter 4 (Part 1): Non-Parametric Classification (Sections ) Pattern Classification 4.3) Announcements

Parameter, Statistic and Random Samples

Lecture 2: The Simple Regression Model

Chapter 3 Sampling For Proportions and Percentages

Estimation of Stress- Strength Reliability model using finite mixture of exponential distributions

Simple Linear Regression

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

Point Estimation: definition of estimators

Lecture 8: Linear Regression

Chapter 8. Inferences about More Than Two Population Central Values

to the estimation of total sensitivity indices

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use.

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

Simple Linear Regression

Econometrics. 3) Statistical properties of the OLS estimator

ρ < 1 be five real numbers. The

CS286.2 Lecture 4: Dinur s Proof of the PCP Theorem

Random Variables and Probability Distributions

Lecture Notes Types of economic variables

1 Solution to Problem 6.40

STATISTICAL INFERENCE

THE ROYAL STATISTICAL SOCIETY 2009 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULAR FORMAT MODULE 2 STATISTICAL INFERENCE

The Mathematical Appendix

Arithmetic Mean Suppose there is only a finite number N of items in the system of interest. Then the population arithmetic mean is

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

Sample Allocation under a Population Model and Stratified Inclusion Probability Proportionate to Size Sampling

ENGI 4421 Propagation of Error Page 8-01

TESTS BASED ON MAXIMUM LIKELIHOOD

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

The TDT. (Transmission Disequilibrium Test) (Qualitative and quantitative traits) D M D 1 M 1 D 2 M 2 M 2D1 M 1

We have already referred to a certain reaction, which takes place at high temperature after rich combustion.

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5

SPECIAL CONSIDERATIONS FOR VOLUMETRIC Z-TEST FOR PROPORTIONS

ESS Line Fitting

Lecture Note to Rice Chapter 8

Third handout: On the Gini Index

Dimensionality Reduction and Learning

MEASURES OF DISPERSION

Multiple Choice Test. Chapter Adequacy of Models for Regression

Bounds on the expected entropy and KL-divergence of sampled multinomial distributions. Brandon C. Roy

Introduction to local (nonparametric) density estimation. methods

Handout #8. X\Y f(x) 0 1/16 1/ / /16 3/ / /16 3/16 0 3/ /16 1/16 1/8 g(y) 1/16 1/4 3/8 1/4 1/16 1

COV. Violation of constant variance of ε i s but they are still independent. The error term (ε) is said to be heteroscedastic.

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018

: At least two means differ SST

ENGI 3423 Simple Linear Regression Page 12-01

8.1 Hashing Algorithms

Transcription:

Importace Samplg Used for a umber of purposes: Varace reducto Allows for dffcult dstrbutos to be sampled from. Sestvty aalyss Reusg samples to reduce computatoal burde. Idea s to sample from a dfferet dstrbuto tat pcks pots mportat regos of te sample space. Wat E f X f x g x dx Istead of samplg from desty (or probablty g x, sample from a mass fucto) dstrbuto wt desty (or pmf) ( x ). Sce we are samplg from te wrog dstrbuto we ave to make adjustmets our estmator.

g( x) ( x) g( X) E f ( X) ( X ) Eg f X f x g x dx f x x dx Ts suggests te followg estmato sceme ) Sample x,, x ) Calculate wegts 3) Use estmator from ˆ µ w x. ( ) g x x wf ( x) f, IS So stead of a regular average, ts estmator s a wegted average. So pots tat occur more ofte uder ( x ) ta g( x ) get dowwegted ad tose tat occur less ofte get upwegted.

Notce tat ˆ µ f, IS s a ubased estmate of Eg f ( X) regardless of wc proposal dstrbuto ( x ) as log as support as g( x ),.e. x as te same g( x ) > 0 mples tat ( x ) > 0 Note tat ( x ) > 0 ca be allowed to occur we g( x ) 0, toug dog ts teds to be effcet (but tere are tmes you wat to do ts). 3

Sce ˆ µ f, IS s ubased, te ma dea s to pck a dstrbuto ( x ) tat reduces te varace. ( X) ( X) f X g X f X g X Var E µ f To do ts, we wat ( x ) to look lke f ( x) g( x ),.e. make look lke a costat. f ( x) g( x) ( x) Te optmal ( x ) satsfes x f x g x f x g x dx Note tat ts usually ca t be determed, due to te ormalzg costat. However ts does gve us a motvato for pckg ( x ). 4

Example: Mote Carlo Evaluato of a Lkelood Rato (Geetcs Example) Assume tat you ave a mssg data model were X ( Xobs, Xms ). Te te observed data lkelood rato satstfes l ( θ, θ ) 0 ( θ ) ( θ ) 0 pθ ( X ) obs θ 0 obs ( obs, ms ) (, ) L L p X E p X X θ X θ 0 obs pθ X 0 obs Xms Ts ca be estmated by samplg z, p X X calculatg from ( ms obs ) ) f ( z ) 0 ( obs, ) (, ) p X z θ p X z θ obs ) lˆ ( θ, θ ) f ( z ) 0, z Suppose tat you are terested gettg l θ, θ, based o ts Mote Carlo estmate. 0 5

Ts ca be doe wt te mportace samplg estmate ( obs, ) (, ) p X z lˆ ( θ, θ0) f ( z ) p X z θ θ obs Ts ca be sow to be a ubased estmator l θ, θ. of 0 Geetcs example: Observed Data Model λ λ λ + λ,,, ~Mult 97,,,, 4 4 4 4 ( Y Y Y Y ) 3 4 ( λ ) gy Y Y + Y Y λ λ + λ 4 4 4 3 4 Complete Data Model ( X, X, X, X, X ) 3 4 5 λ λ λ λ ~ Mult 97,,,,, 4 4 4 4 6

( λ ) g X X + X X + X X λ λ 4 4 4 3 5 λ As see before X4 Y4 ~B Y4, + λ Te complete data lkelood rato satsfes (, λ0 ) Y + X Y + Y λ0 λ0 gyx, λ λ λ g Y X 4 3 Note tat ts mples te mportace sample wegt satsfes w I ts case ˆ (, ) θ (,,, θ, θ ) c Y Y Y3 θ l λ λ as te form 0 (,,,, ) ˆ c Y Y Y3 λ λ λ l ( λ, λ0) f ( z ) λ z z 7

As may problems, te desred samplg dstrbuto does t eed to be kow exactly, but oly up to te ormalzg costat (.e. l x cg x ). Importace samplg stll works fe ts case. ) Sample x,, x ) Calculate wegts 3) Use estmator x. from ˆ µ w ( ) l x x ( x) f, IS W w wf wf ( x) were W w. 8

Propertes of ts estmator ( x) cg( x) dx l x E w( X) f ( X) f x x dx x f ce f X g l x E [ w] ( x) dx x cg x dx c As ts s a rato estmator, t s o loger ubased, but t s cosstet. I addto, we c s kow, ts estmator s ofte preferred to te ubased oe dscussed last tme as t ofte as a smaller mea square error (to be dscussed later). Effcecy of mportace samplg Effectve sample sze ESS ( ) + var ( w( X) ) 9

Sce te wegts are usually oly kow up to te ormalzg costat, var ( w( X )) eeds to be estmated by te coeffcet of varato of te uormalzed wegts cv ( w) ( w w) ( ) w Assume for wat follows tat c (ad ts s kow). We ave two estmators µ wf ( x) (ubased) µ ˆ µ wf ( x) w w (rato) Let Z w( X) f ( X). Te by te delta metod ( ) [ ˆ µ ] ( ) + ( ) E E Z w w cov, Var µ + ( wz) µ ( W) 0

I addto, Var ( ˆ µ ) + ( µ Var ( w) Var ( Z) µ Cov ( w, Z) ) For te ubased estmator, [ ] ad Var ( µ ) Var E µ µ Tus MSE ad E [ ] ( µ ) ( ) µ Z Var ( Z ) MSE ˆ µ ˆ µ µ + Var ˆ µ MSE + + ( µ Var ( w) µ Cov ( w, Z) ) O( ) Tus te rato estmate s to be preferred we ( wz) > ( w) µ Cov, µ Var

(assumg µ > 0). Tat s we w( X ) ad w( X) f ( X ) are gly correlated. I addto, te formula for Var ( ˆ) (wole buc of steps omtted) ( ) µ mples ( ) Var µ Var f X + Var w X g Rearragg ts gves ( f ( X) ) Varg Var + Var ( w X f X ) ( w( X) ) Oe way of tkg of ts statemet s tat x s samples from te proposal dstrbuto wort ( Var ( w( X) )) te target dstrbuto g( x ). + samples draw from Te ce tg wt ts rule of tumb s tat t does t deped o te fucto beg tegrated.

Tus te effectve sample sze s a useful measure of te effcecy of te metod we dfferet fuctos f are vestgated wt a sgle sample. Oe cosequece of ts s tat we wat to keep te coeffcet of varato of te wegts well beaved. Oe way of dog ts s to ave te proposal x be eaver taled ta te dstrbuto target dstrbuto g( x ). Ts wll elp mmze E g( X) g( x) ( X) ( x) g X Eg X x dx wc wll keep Var g X X small. For example, use a t dstrbuto wt moderate degrees of freedom stead of a ormal. 3

Margalzato mportace samplg Let g( X, X ) ad (, ) X X be two probablty destes were te support of g s a subset of g X, X > 0 mples te support of (e.g. ( ) ( X, X ) > 0). Te Var (, ) (, ) were g( X ) ad ( ) g X X g X Var X X X X are te respectve margal dstrbutos. E (, X ) X (, X) (, ) ( X, X) g( X, X ) g X X g X X X X dx ( ) ( ) X X X g X g( X X ) dx X X X dx g X X 4

Tus Var ( ) g X, X g X, X Var E X X, X X, X g X Var X I addto Var (, ) (, ) g X X g X Var X X X (, X) (, X ) g X E Var X X Te mplcato of ts result s tat were possble, mmze te umber of varables you sample as ts wll crease ESS( ). However te computatoal burde must also be cosdered. 5

For example, f Var (, ) (, ) g X X g X Var X X X but te computatoal tme volved samplg ( X ) s 4 tmes te tme volved samplg ( X, X ), samplg over te secod space s to be preferred. Smlarly, Rao-Blackwellzato ca be wt mportace samplg, gvg a estmate of te form µ W (, ) we f X X x g, Wle ts s a cosstet estmate, t may ot ave a smaller varace ta te o-rao- Blackwellzed estmate. However may stuatos t sould. Te followg s a example were t dd elp. 6

Estmatg recombato fractos wt pedgree data 4 members 7 (ofouders), f 4 (fouders) 8 markers from cromosome 9 #alleles rages from 6 to 8 4 members top geerato ave o marker data Wat to use pedgree to estmate te dstaces betwee te 8 markers 7

A D A D No Recombato Recombato A D A D No Recombato Recombato Ca use te recombato fracto (P[recombato]) as a measure of dstace. 8

Data Structure x l data o locus l x,..., M x x m m markers o o x D dsease / trat data o x ( x x x ),...,, m y l aplotype for locus l o y ( y y y ),...,, m D D o Allele formato wt paretal source z l ertace vector for locus l o z ( z z z ),...,, m D o Idcators of weter allele erted came from te gradmoter or gradfater Assume tat z l s a vector of legt. Te a estmate of θ j, te recombato fracto betwee markers j ad j + s (, z, ) θ j I zj j+ assumg tat z s kow. 9

Ufortuately, for most data sets, z ca t be determed wt certaty. However t s possble to do smulato ad estmate te recombato fractos usg Mote Carlo EM (MCEM) As part of ts procedure s t ecessary to calculate E zl x, θ va Mote Carlo. Tere are two approaces to dog ts. ) Sample samplg ( ) y from p θ yx by mportace Sample ( ) z from ( (), ) p θ zxy Estmate E zl x, θ by ) Sample samplg m w l W z ( ) y from p θ Estmate E zl x, θ by m () () we l, W yx by mportace z y θ 0

Bot approaces estmate exactly te same quatty, but te d approac s muc more effcet. Frst te secod approac as a smaller varace. I addto, calculatg te expected value s actually faster ta samplg te z s. (usually does t work out ts way).