Some Examples on Gibbs Sampling and Metropolis-Hastings methods

Size: px
Start display at page:

Download "Some Examples on Gibbs Sampling and Metropolis-Hastings methods"

Transcription

1 Soe Exaples o Gibbs Saplig ad Metropolis-Hastigs ethods S420/620 Itroductio to Statistical Theory, Fall 2012

2 Gibbs Sapler Saple a ultidiesioal probability distributio fro coditioal desities. Suppose d = 2, θ = (θ 1, θ 2 ). Set a iitial poit, θ (0) = (θ (0) 1, θ(0) 2 ). 1 Geerate a value θ (1) 2 fro π(θ 2 θ 1 = θ (0) 1, X = x). 2 Geerate a value θ (1) 1 fro π(θ 1 θ 2 = θ (1) 2, X = x). Two steps give a ew value θ (1) = (θ (0) 1, θ(0) 2 ) After several iteratios: θ (1), θ (2), θ (3),..., Saples (after traiig period) correspod to π(θ 1, θ 2 x). S420/620 Itroductio to Statistical Theory, Fall 2012

3 Exaple: Bioial-Beta Oe observatio, X Bioial(, θ) ad π(θ) = Beta(a, b) Bayes theore, π(θ x) θ x (1 θ) x θ a 1 (1 θ) b 1 The joit desity for (x, θ), ( ) f (x; θ)π(θ) = θ x x Γ(a + b) (1 θ) x Γ(a)Γ(b) θa 1 (1 θ) b 1 What is the argial distributio of X? Its a Bioial-Beta distributio, ( ) Γ(a + b) Γ(a + x)γ(b + x) f (x) =, x = 0, 1, 2,..., x Γ(a)Γ(b) Γ(a + b + ) S420/620 Itroductio to Statistical Theory, Fall 2012

4 Via Gibbs saplig. Set iitial value (x (0), θ (0) ). For i = 1, 2,... M 1 Saple x (i) f (x θ (i 1) ) = Bioial(, θ (i 1) ) 2 Saple θ (i) π(θ x (i) ) = Beta(a + x (i), b + x (i) ) 3 Repeat (1) ad (2) ay ties. The process produces saples of (x, θ). Approxiate f (x) with the values x (i) For exaple, f (x) = P(X = x) (# of X (i) = x)/m We ay also, (1) Saple θ (i) π(θ x (i 1) ) (2) Saple x (i) f (x θ (i) ). S420/620 Itroductio to Statistical Theory, Fall 2012

5 # Exaple 1:Beta-Bioial siulatio # Prior paraeters a=2; b=4; =16 # prepare objects to save iteratios it =1500 x= rep(na,it); th=rep(na,it) # set iitial value x[1]=1; th[1]=0.5 # Perfor Gibbs iteratios for (i i 2:it) { x[i] =rbio(1,size=,prob=th[i-1]) th[i] = rbeta(1,a+x[i],b+-x[i]) } S420/620 Itroductio to Statistical Theory, Fall 2012

6 Trace plots for siulatios of x ad θ x iteratio th iteratio S420/620 Itroductio to Statistical Theory, Fall 2012

7 Histogra x values (f(x)) Desity x S420/620 Itroductio to Statistical Theory, Fall 2012

8 Histogra θ values (π(θ) ) Desity theta S420/620 Itroductio to Statistical Theory, Fall 2012

9 I d-diesios, we iteratively use π i (θ i θ i, x) where θ i = (θ 1, θ 2,..., θ i1, θ i+1,..., θ d ). Gibbs saplig is a exaple of Markov Chai Mote Carlo (MCMC) ethods. A ew poit θ (s+1) depeds o θ (s) Mote Carlo: pseudo-rado values. The liit or statioary distributio is π(θ x). Gibbs saplig assues direct siulatio fro full coditioals. If full coditioal siulatio is ot possible, the we could use Metropolis-Hastigs. S420/620 Itroductio to Statistical Theory, Fall 2012

10 Exaple: Bioial-Beta-Poisso Treat as ukow with π() = Poisso(λ), (λ kow). X Bioial(, θ); π(θ) = Beta(a, b) The joit distributio for (X, θ, ) is: ( ) ) θ x x Γ(a + b) (1 θ) x Γ(a)Γ(b) θa 1 (1 θ) (e b 1 λ λ! x = 0, 1,..., ; 0 < θ < 1; = 0, 1, 2,.... Agai, iterested o f (x) but ipossible to fid i aalytic-closed for. Alteratively (get rid of costats), f (θ, x, ) ( x )θ a+x 1 (1 θ) b+ x 1 λ! S420/620 Itroductio to Statistical Theory, Fall 2012

11 For Gibbs Saplig, fid full coditioals: f (x θ, ) ( x) θ x (1 θ) x Bioial(, θ), π(θ x, ) θ a+x 1 (1 θ) b+ x 1 Beta(a + x, b + x), ) λ π( θ, x) ( x! (1 θ) x [λ(1 θ)] x ( x)! ; = x, x + 1, x + 2,..., If we set, z = x, the z Poisso(λ(1 θ)) Set (x (0), θ (0), (0) ). For i = 1, 2, 3,..., Saple x (i) Bioial( (i 1), θ (i 1) ). Saple θ (i) Beta(a + x (i), b + (i 1) x (i) ). Saple (i) = x (i) + z, z Poisso(λ(1 θ (i) )) Repeat util covergece is reached. S420/620 Itroductio to Statistical Theory, Fall 2012

12 Trace plot ad histogra of values iteratio Histogra of Desity S420/620 Itroductio to Statistical Theory, Fall 2012

13 Histogra of x values Desity x S420/620 Itroductio to Statistical Theory, Fall 2012

14 Exaple with Bivariate Noral Distributio x = (x 1, x 2 ), µ = (µ 1, µ 2 ) ad Σ is a 2 2 covariace atrix with diagoal etries σ1 2, σ2 2 ad off-diagoals σ 1,2. The pdf is ( f (x µ, Σ) Σ 1/2 exp 1 ) 2 (x µ)t Σ 1 (x µ) Margial distributios: x 1 N(µ 1, σ 2 1 ) ad x 2 N(µ 2, σ 2 2 ). For Gibbs saplig, we eed f (x 1 x 2 ) ad f (x 2 x 1 ). I fact f (x 1 x 2 ) = N(µ 1 + (σ 1,2 /σ 2 2 )(x 2 µ 2 ), σ 2 1 (σ 1,2/σ 2 ) 2 ) f (x 2 x 1 ) = N(µ 2 + (σ 1,2 /σ 2 1 )(x 1 µ 1 ), σ 2 2 (σ 1,2/σ 1 ) 2 ) Exaple with µ = (2, 1), σ 2 1 = σ2 2 = 1 ad σ 1,2 = 0.7. S420/620 Itroductio to Statistical Theory, Fall 2012

15 Trace plots for siulatios of x 1 ad x 2 xs[, 1] Idex xs[, 2] Idex S420/620 Itroductio to Statistical Theory, Fall 2012

16 Histogras for siulatios of x 1 ad x 2 Histogra of xs[-(1:100), 1] Desity x1 Histogra of xs[-(1:100), 2] Desity x2 S420/620 Itroductio to Statistical Theory, Fall 2012

17 British coal iig disasters data o. of disasters tie Histogra of y Desity y S420/620 Itroductio to Statistical Theory, Fall 2012

18 Exaple: Poisso process with a Chage Poit Hierarchical odel proposed i Carli, Gelfad ad Sith (1992) (also see Sectio 3.10). First stage: X i Poisso(µ), i = 1, 2,..., k. X i Poisso(λ), i = k + 1, k + 2,..., so X i represets the observatios ad the paraeter of iterest are (µ, λ, k). Data cosists of coal-iig disasters i the U.K. Secod stage: Idepedet priors o (µ, λ, k). k discrete uifor o {1, 2,..., }, =saple size. µ Ga(a 1, b 1 ) ad λ Ga(a 2, b 2 ) a 1, b 1, a 2, b 2 are fixed. S420/620 Itroductio to Statistical Theory, Fall 2012

19 The joit posterior distributio for (µ, λ, k) has the for: π(µ, λ, k X) f (X µ, λ, k)π(µ)π(λ)π(k) The likelihood fuctio is: f (X θ, λ, k) = = k f (x i µ, k) f (x i λ, k) i=1 k µ x i e µ i=1 x i! i=k+1 i=k+1 λ x i e λ x i! Therefore, π(µ, λ, k X) k µ x i e µ i=1 x i! i=k+1 λ x i e λ x i! ( ) (µ a1 1 e µb 1 )(λ a2 1 e λb 2 1 ) S420/620 Itroductio to Statistical Theory, Fall 2012

20 Or equivaletly, π(µ, λ, k X) µ a 1+ k i=1 x i 1 e µ(k+b 1) λ a 2+ i=k+1 x i 1 e λ( k+b 2) Full coditioal distributios, 1 π(µ λ, k, X) µ a 1+ k i=1 x i 1 e µ(k+b1) = Ga(a 1 + k i=1 x i, k + b 1 ) 2 π(λ µ, k, X) λ a 2+ i=k+1 x i 1 e λ( k+b2) = Ga(a 2 + i=k+1 x i, k + b 2 ) 3 π(k µ, λ, X) µ k i=1 x i λ i=k+1 x i e k(µ λ) L(X µ, λ, k) So, p(k µ, λ, X) = L(X µ,λ,k) k=1, k = 1, 2,..., L(X µ,λ,k) Soe results with the British Coal iig disaster data follow. S420/620 Itroductio to Statistical Theory, Fall 2012

21 k Series draws[, i] Histogra of draws[, i] theta ACF Desity iteratio Lag theta Series draws[, i] Histogra of draws[, i] labda ACF Desity iteratio Lag labda Series draws[, i] Histogra of draws[, i] ACF Desity iteratio Lag k S420/620 Itroductio to Statistical Theory, Fall 2012

22 Metropolis-Hastigs Algorith (cotiuous case) Saple (idirectly) a value X fro a pdf f (x). I Bayesia probles f (x) is the posterior distributio. Need a trial or proposal desity q(x, y) 0. 1 Set a iitial value X (0). For = 1, 2,..., 2 Give X () = x, geerate Y = y, fro q(x, y). 3 Copute, { α = i 1, } f (y)q(x, y) f (x)q(x, y) 4 U U(0, 1). If U α, the X (+1) = y. Otherwise, X (+1) = x 5 Chage to + 1 ad go back to 2. S420/620 Itroductio to Statistical Theory, Fall 2012

23 Coo to ru algorith say 1000 iteratios ad delete the first, say 100 (bur-i). Uder a syetric proposal (Metropolis), q(x, y) = q(y, x) ad { α = i 1, f (y) } f (x) Rado walk proposal, h is a scalig paraeter. y = x + z; z N(0, h 2 ). h is specified so that 40% 60% acceptace rate is achieved. S420/620 Itroductio to Statistical Theory, Fall 2012

24 Exaple with Bivariate Noral Distributio θ = (θ 1, θ 2 ), µ = (µ 1, µ 2 ) ad S is a 2 2 covariace atrix with diagoal etries s 1,1, s 2,2 ad off-diagoals s 1,2 = s 2,1. The pdf is ( π(θ) S 1/2 exp 1 ) 2 (θ µ)t S 1 (θ µ) Margial distributios: θ 1 N(µ 1, s 1,1 ) ad θ 2 N(µ 2, s 2,2 ). Exaple with µ = (2, 1), s 1,1 = s 2,2 = 1 ad s 1,2 = 0.7. S420/620 Itroductio to Statistical Theory, Fall 2012

25 Trial or cadidate desity q(θ, θ ), also a bivariate oral so θ = θ + z; z N(0, Σ) with Σ a diagoal atrix with diagoal etries 0.6, 0.4. z 1 N(0, 0.6) ad z 2 N(0, 0.4). Rado walk proposal so q(θ, θ ) = q(θ, θ). The Metropolis ratio is: { } α(θ, θ ) = i 1, π(θ ) = π(θ) ( ) exp 1 = i 1, 2 (θ µ) t S 1 (θ µ) exp ( 1 2 (θ µ)t S 1 (θ µ) ) S420/620 Itroductio to Statistical Theory, Fall 2012

26 Fro a iitial poit θ (0). For i = 1, 2,..., θ = θ (i 1) + z The copute the acceptace probability, α(θ (i 1), θ ). Make θ (i) = θ if U < α(θ (i 1), θ ). Otherwise, θ (i) = θ (i 1). Fro θ (0) = ( 1, 3), we ll look at: Tie series plots of θ = (θ 1, θ 2 ) values. Histogras of argial saples for θ 1 ad θ 2. Bivariate plots of saples. S420/620 Itroductio to Statistical Theory, Fall 2012

27 theta iteratio theta iteratio S420/620 Itroductio to Statistical Theory, Fall 2012

28 Margial for Theta theta1 Margial for Theta theta2 S420/620 Itroductio to Statistical Theory, Fall 2012

29 2 0-2 theta siulatios fr o the Bivariate Noral theta theta siulatios fro the M-H algorith theta 1 S420/620 Itroductio to Statistical Theory, Fall

Statistics for Applications Fall Problem Set 7

Statistics for Applications Fall Problem Set 7 18.650. Statistics for Applicatios Fall 016. Proble Set 7 Due Friday, Oct. 8 at 1 oo Proble 1 QQ-plots Recall that the Laplace distributio with paraeter λ > 0 is the cotiuous probaλ bility easure with

More information

Define a Markov chain on {1,..., 6} with transition probability matrix P =

Define a Markov chain on {1,..., 6} with transition probability matrix P = Pla Group Work 0. The title says it all Next Tie: MCMC ad Geeral-state Markov Chais Midter Exa: Tuesday 8 March i class Hoework 4 due Thursday Uless otherwise oted, let X be a irreducible, aperiodic Markov

More information

Bayesian Methods: Introduction to Multi-parameter Models

Bayesian Methods: Introduction to Multi-parameter Models Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

Queueing Theory II. Summary. M/M/1 Output process Networks of Queue Method of Stages. General Distributions

Queueing Theory II. Summary. M/M/1 Output process Networks of Queue Method of Stages. General Distributions Queueig Theory II Suary M/M/1 Output process Networks of Queue Method of Stages Erlag Distributio Hyperexpoetial Distributio Geeral Distributios Ebedded Markov Chais 1 M/M/1 Output Process Burke s Theore:

More information

Contents Two Sample t Tests Two Sample t Tests

Contents Two Sample t Tests Two Sample t Tests Cotets 3.5.3 Two Saple t Tests................................... 3.5.3 Two Saple t Tests Setup: Two Saples We ow focus o a sceario where we have two idepedet saples fro possibly differet populatios. Our

More information

APPLIED MULTIVARIATE ANALYSIS

APPLIED MULTIVARIATE ANALYSIS ALIED MULTIVARIATE ANALYSIS FREQUENTLY ASKED QUESTIONS AMIT MITRA & SHARMISHTHA MITRA DEARTMENT OF MATHEMATICS & STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANUR X = X X X [] The variace covariace atrix

More information

Modelling Missing Data. Missing Data Mechanism. Problem: Some data Y ij may be missing. Complete-data model:

Modelling Missing Data. Missing Data Mechanism. Problem: Some data Y ij may be missing. Complete-data model: Coplete-data odel: Modellig Missig Data idepedet ad idetically distributed () draws Y,, Y fro ultivariate distributio P θ, Y i = (Y i,, Y ip ) T P θ Data atrix: Y = (Y,, Y ) T = (Y ij ),,;j=,,p Uits Y

More information

Lecture 11 and 12: Basic estimation theory

Lecture 11 and 12: Basic estimation theory Lecture ad 2: Basic estimatio theory Sprig 202 - EE 94 Networked estimatio ad cotrol Prof. Kha March 2 202 I. MAXIMUM-LIKELIHOOD ESTIMATORS The maximum likelihood priciple is deceptively simple. Louis

More information

Stanford Statistics 311/Electrical Engineering 377

Stanford Statistics 311/Electrical Engineering 377 I. Uiversal predictio ad codig a. Gae: sequecex ofdata, adwattopredict(orcode)aswellasifwekew distributio of data b. Two versios: probabilistic ad adversarial. I either case, let p ad q be desities or

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Surveying the Variance Reduction Methods

Surveying the Variance Reduction Methods Available olie at www.scizer.co Austria Joural of Matheatics ad Statistics, Vol 1, Issue 1, (2017): 10-15 ISSN 0000-0000 Surveyig the Variace Reductio Methods Arash Mirtorabi *1, Gholahossei Gholai 2 1.

More information

On Modeling On Minimum Description Length Modeling. M-closed

On Modeling On Minimum Description Length Modeling. M-closed O Modelig O Miiu Descriptio Legth Modelig M M-closed M-ope Do you believe that the data geeratig echais really is i your odel class M? 7 73 Miiu Descriptio Legth Priciple o-m-closed predictive iferece

More information

AN EFFICIENT ESTIMATION METHOD FOR THE PARETO DISTRIBUTION

AN EFFICIENT ESTIMATION METHOD FOR THE PARETO DISTRIBUTION Joural of Statistics: Advaces i Theory ad Applicatios Volue 3, Nuber, 00, Pages 6-78 AN EFFICIENT ESTIMATION METHOD FOR THE PARETO DISTRIBUTION Departet of Matheatics Brock Uiversity St. Catharies, Otario

More information

A string of not-so-obvious statements about correlation in the data. (This refers to the mechanical calculation of correlation in the data.

A string of not-so-obvious statements about correlation in the data. (This refers to the mechanical calculation of correlation in the data. STAT-UB.003 NOTES for Wedesday 0.MAY.0 We will use the file JulieApartet.tw. We ll give the regressio of Price o SqFt, show residual versus fitted plot, save residuals ad fitted. Give plot of (Resid, Price,

More information

We have also learned that, thanks to the Central Limit Theorem and the Law of Large Numbers,

We have also learned that, thanks to the Central Limit Theorem and the Law of Large Numbers, Cofidece Itervals III What we kow so far: We have see how to set cofidece itervals for the ea, or expected value, of a oral probability distributio, both whe the variace is kow (usig the stadard oral,

More information

CS 70 Second Midterm 7 April NAME (1 pt): SID (1 pt): TA (1 pt): Name of Neighbor to your left (1 pt): Name of Neighbor to your right (1 pt):

CS 70 Second Midterm 7 April NAME (1 pt): SID (1 pt): TA (1 pt): Name of Neighbor to your left (1 pt): Name of Neighbor to your right (1 pt): CS 70 Secod Midter 7 April 2011 NAME (1 pt): SID (1 pt): TA (1 pt): Nae of Neighbor to your left (1 pt): Nae of Neighbor to your right (1 pt): Istructios: This is a closed book, closed calculator, closed

More information

Achieving Stationary Distributions in Markov Chains. Monday, November 17, 2008 Rice University

Achieving Stationary Distributions in Markov Chains. Monday, November 17, 2008 Rice University Istructor: Achievig Statioary Distributios i Markov Chais Moday, November 1, 008 Rice Uiversity Dr. Volka Cevher STAT 1 / ELEC 9: Graphical Models Scribe: Rya E. Guerra, Tahira N. Saleem, Terrace D. Savitsky

More information

Pseudo-marginal Metropolis-Hastings: a simple explanation and (partial) review of theory

Pseudo-marginal Metropolis-Hastings: a simple explanation and (partial) review of theory Pseudo-arginal Metropolis-Hastings: a siple explanation and (partial) review of theory Chris Sherlock Motivation Iagine a stochastic process V which arises fro soe distribution with density p(v θ ). Iagine

More information

Bayesian inference for stochastic differential mixed effects models - initial steps

Bayesian inference for stochastic differential mixed effects models - initial steps Bayesian inference for stochastic differential ixed effects odels - initial steps Gavin Whitaker 2nd May 2012 Supervisors: RJB and AG Outline Mixed Effects Stochastic Differential Equations (SDEs) Bayesian

More information

Some Basic Probability Concepts. 2.1 Experiments, Outcomes and Random Variables

Some Basic Probability Concepts. 2.1 Experiments, Outcomes and Random Variables Some Basic Probability Cocepts 2. Experimets, Outcomes ad Radom Variables A radom variable is a variable whose value is ukow util it is observed. The value of a radom variable results from a experimet;

More information

The Bayesian Learning Framework. Back to Maximum Likelihood. Naïve Bayes. Simple Example: Coin Tosses. Given a generative model

The Bayesian Learning Framework. Back to Maximum Likelihood. Naïve Bayes. Simple Example: Coin Tosses. Given a generative model Back to Maximum Likelihood Give a geerative model f (x, y = k) =π k f k (x) Usig a geerative modellig approach, we assume a parametric form for f k (x) =f (x; k ) ad compute the MLE θ of θ =(π k, k ) k=

More information

LECTURE NOTES 9. 1 Point Estimation. 1.1 The Method of Moments

LECTURE NOTES 9. 1 Point Estimation. 1.1 The Method of Moments LECTURE NOTES 9 Poit Estimatio Uder the hypothesis that the sample was geerated from some parametric statistical model, a atural way to uderstad the uderlyig populatio is by estimatig the parameters of

More information

ECE 901 Lecture 4: Estimation of Lipschitz smooth functions

ECE 901 Lecture 4: Estimation of Lipschitz smooth functions ECE 9 Lecture 4: Estiatio of Lipschitz sooth fuctios R. Nowak 5/7/29 Cosider the followig settig. Let Y f (X) + W, where X is a rado variable (r.v.) o X [, ], W is a r.v. o Y R, idepedet of X ad satisfyig

More information

arxiv: v1 [math.st] 12 Dec 2018

arxiv: v1 [math.st] 12 Dec 2018 DIVERGENCE MEASURES ESTIMATION AND ITS ASYMPTOTIC NORMALITY THEORY : DISCRETE CASE arxiv:181.04795v1 [ath.st] 1 Dec 018 Abstract. 1) BA AMADOU DIADIÉ AND 1,,4) LO GANE SAMB 1. Itroductio 1.1. Motivatios.

More information

Tomoki Toda. Augmented Human Communication Laboratory Graduate School of Information Science

Tomoki Toda. Augmented Human Communication Laboratory Graduate School of Information Science Seuetial Data Modelig d class Basics of seuetial data odelig ooki oda Augeted Hua Couicatio Laboratory Graduate School of Iforatio Sciece Basic Aroaches How to efficietly odel joit robability of high diesioal

More information

Unbiased Estimation. February 7-12, 2008

Unbiased Estimation. February 7-12, 2008 Ubiased Estimatio February 7-2, 2008 We begi with a sample X = (X,..., X ) of radom variables chose accordig to oe of a family of probabilities P θ where θ is elemet from the parameter space Θ. For radom

More information

Exponential Families and Bayesian Inference

Exponential Families and Bayesian Inference Computer Visio Expoetial Families ad Bayesia Iferece Lecture Expoetial Families A expoetial family of distributios is a d-parameter family f(x; havig the followig form: f(x; = h(xe g(t T (x B(, (. where

More information

Lecture 19. Curve fitting I. 1 Introduction. 2 Fitting a constant to measured data

Lecture 19. Curve fitting I. 1 Introduction. 2 Fitting a constant to measured data Lecture 9 Curve fittig I Itroductio Suppose we are preseted with eight poits of easured data (x i, y j ). As show i Fig. o the left, we could represet the uderlyig fuctio of which these data are saples

More information

Probability and MLE.

Probability and MLE. 10-701 Probability ad MLE http://www.cs.cmu.edu/~pradeepr/701 (brief) itro to probability Basic otatios Radom variable - referrig to a elemet / evet whose status is ukow: A = it will rai tomorrow Domai

More information

HOMEWORK I: PREREQUISITES FROM MATH 727

HOMEWORK I: PREREQUISITES FROM MATH 727 HOMEWORK I: PREREQUISITES FROM MATH 727 Questio. Let X, X 2,... be idepedet expoetial radom variables with mea µ. (a) Show that for Z +, we have EX µ!. (b) Show that almost surely, X + + X (c) Fid the

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

Statistics and Data Analysis in MATLAB Kendrick Kay, February 28, Lecture 4: Model fitting

Statistics and Data Analysis in MATLAB Kendrick Kay, February 28, Lecture 4: Model fitting Statistics ad Data Aalysis i MATLAB Kedrick Kay, kedrick.kay@wustl.edu February 28, 2014 Lecture 4: Model fittig 1. The basics - Suppose that we have a set of data ad suppose that we have selected the

More information

The Binomial Multi-Section Transformer

The Binomial Multi-Section Transformer 4/15/2010 The Bioial Multisectio Matchig Trasforer preset.doc 1/24 The Bioial Multi-Sectio Trasforer Recall that a ulti-sectio atchig etwork ca be described usig the theory of sall reflectios as: where:

More information

Chapter 2. Asymptotic Notation

Chapter 2. Asymptotic Notation Asyptotic Notatio 3 Chapter Asyptotic Notatio Goal : To siplify the aalysis of ruig tie by gettig rid of details which ay be affected by specific ipleetatio ad hardware. [1] The Big Oh (O-Notatio) : It

More information

Summary. Recap ... Last Lecture. Summary. Theorem

Summary. Recap ... Last Lecture. Summary. Theorem Last Lecture Biostatistics 602 - Statistical Iferece Lecture 23 Hyu Mi Kag April 11th, 2013 What is p-value? What is the advatage of p-value compared to hypothesis testig procedure with size α? How ca

More information

Asymptotics. Hypothesis Testing UMP. Asymptotic Tests and p-values

Asymptotics. Hypothesis Testing UMP. Asymptotic Tests and p-values of the secod half Biostatistics 6 - Statistical Iferece Lecture 6 Fial Exam & Practice Problems for the Fial Hyu Mi Kag Apil 3rd, 3 Hyu Mi Kag Biostatistics 6 - Lecture 6 Apil 3rd, 3 / 3 Rao-Blackwell

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

CS Lecture 13. More Maximum Likelihood

CS Lecture 13. More Maximum Likelihood CS 6347 Lecture 13 More Maxiu Likelihood Recap Last tie: Introduction to axiu likelihood estiation MLE for Bayesian networks Optial CPTs correspond to epirical counts Today: MLE for CRFs 2 Maxiu Likelihood

More information

Will Monroe August 9, with materials by Mehran Sahami and Chris Piech. image: Arito. Parameter learning

Will Monroe August 9, with materials by Mehran Sahami and Chris Piech. image: Arito. Parameter learning Will Monroe August 9, 07 with aterials by Mehran Sahai and Chris Piech iage: Arito Paraeter learning Announceent: Proble Set #6 Goes out tonight. Due the last day of class, Wednesday, August 6 (before

More information

Introduction to Optimization, DIKU Monday 19 November David Pisinger. Duality, motivation

Introduction to Optimization, DIKU Monday 19 November David Pisinger. Duality, motivation Itroductio to Optiizatio, DIKU 007-08 Moday 9 Noveber David Pisiger Lecture, Duality ad sesitivity aalysis Duality, shadow prices, sesitivity aalysis, post-optial aalysis, copleetary slackess, KKT optiality

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

Hypothesis tests and confidence intervals

Hypothesis tests and confidence intervals Hypothesis tests ad cofidece itervals The 95% cofidece iterval for µ is the set of values, µ 0, such that the ull hypothesis H 0 : µ = µ 0 would ot be rejected by a two-sided test with α = 5%. The 95%

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tau.edu/~suhasii/teachig.htl Suhasii Subba Rao Exaple The itroge cotet of three differet clover plats is give below. 3DOK1 3DOK5 3DOK7

More information

Learning Theory for Conditional Risk Minimization: Supplementary Material

Learning Theory for Conditional Risk Minimization: Supplementary Material Learig Theory for Coditioal Risk Miiizatio: Suppleetary Material Alexader Zii IST Austria azii@istacat Christoph H Lapter IST Austria chl@istacat Proofs Proof of Theore After the applicatio of (6) ad (8)

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Chapter 9 Maximum Likelihood Estimatio 9.1 The Likelihood Fuctio The maximum likelihood estimator is the most widely used estimatio method. This chapter discusses the most importat cocepts behid maximum

More information

Recursive Updating Fixed Parameter

Recursive Updating Fixed Parameter Recursive Udatig Fixed Parameter So far we ve cosidered the sceario where we collect a buch of data ad the use that (ad the rior PDF) to comute the coditioal PDF from which we ca the get the MAP, or ay

More information

5.6 Binomial Multi-section Matching Transformer

5.6 Binomial Multi-section Matching Transformer 4/14/21 5_6 Bioial Multisectio Matchig Trasforers 1/1 5.6 Bioial Multi-sectio Matchig Trasforer Readig Assiget: pp. 246-25 Oe way to axiize badwidth is to costruct a ultisectio Γ f that is axially flat.

More information

Mixture models (cont d)

Mixture models (cont d) 6.867 Machie learig, lecture 5 (Jaakkola) Lecture topics: Differet types of ixture odels (cot d) Estiatig ixtures: the EM algorith Mixture odels (cot d) Basic ixture odel Mixture odels try to capture ad

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

Generalized Semi- Markov Processes (GSMP)

Generalized Semi- Markov Processes (GSMP) Geeralized Semi- Markov Processes (GSMP) Summary Some Defiitios Markov ad Semi-Markov Processes The Poisso Process Properties of the Poisso Process Iterarrival times Memoryless property ad the residual

More information

Sample size calculations. $ available $ per sample

Sample size calculations. $ available $ per sample Saple size calculatios = $ available $ per saple Too few aials A total waste Too ay aials A partial waste Power X 1,...,X iid oralµ A, A Y 1,...,Y iid oralµ B, B Test H 0 : µ A = µ B vs H a : µ A µ B at

More information

The Hypergeometric Coupon Collection Problem and its Dual

The Hypergeometric Coupon Collection Problem and its Dual Joural of Idustrial ad Systes Egieerig Vol., o., pp -7 Sprig 7 The Hypergeoetric Coupo Collectio Proble ad its Dual Sheldo M. Ross Epstei Departet of Idustrial ad Systes Egieerig, Uiversity of Souther

More information

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes. Term Test October 3, 003 Name Math 56 Studet Number Directio: This test is worth 50 poits. You are required to complete this test withi 50 miutes. I order to receive full credit, aswer each problem completely

More information

January 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS

January 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS Jauary 25, 207 INTRODUCTION TO MATHEMATICAL STATISTICS Abstract. A basic itroductio to statistics assumig kowledge of probability theory.. Probability I a typical udergraduate problem i probability, we

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

5.6 Binomial Multi-section Matching Transformer

5.6 Binomial Multi-section Matching Transformer 4/14/2010 5_6 Bioial Multisectio Matchig Trasforers 1/1 5.6 Bioial Multi-sectio Matchig Trasforer Readig Assiget: pp. 246-250 Oe way to axiize badwidth is to costruct a ultisectio Γ f that is axially flat.

More information

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam. Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the

More information

A PROBABILITY PROBLEM

A PROBABILITY PROBLEM A PROBABILITY PROBLEM A big superarket chai has the followig policy: For every Euros you sped per buy, you ear oe poit (suppose, e.g., that = 3; i this case, if you sped 8.45 Euros, you get two poits,

More information

Acoustic Field inside a Rigid Cylinder with a Point Source

Acoustic Field inside a Rigid Cylinder with a Point Source Acoustic Field iside a Rigid Cylider with a Poit Source 1 Itroductio The ai objectives of this Deo Model are to Deostrate the ability of Coustyx to odel a rigid cylider with a poit source usig Coustyx

More information

Machine Learning Basics: Estimators, Bias and Variance

Machine Learning Basics: Estimators, Bias and Variance Machine Learning Basics: Estiators, Bias and Variance Sargur N. srihari@cedar.buffalo.edu This is part of lecture slides on Deep Learning: http://www.cedar.buffalo.edu/~srihari/cse676 1 Topics in Basics

More information

Mathematical Statistics Anna Janicka

Mathematical Statistics Anna Janicka Mathematical Statistics Aa Jaicka Lecture XIV, 5.06.07 BAYESIAN STATISTICS Pla for Today. BayesiaStatistics a priori ad a posteriori distributios Bayesia estimatio: Maximum a posteriori probability(map)

More information

Solutions: Homework 3

Solutions: Homework 3 Solutios: Homework 3 Suppose that the radom variables Y,...,Y satisfy Y i = x i + " i : i =,..., IID where x,...,x R are fixed values ad ",...," Normal(0, )with R + kow. Fid ˆ = MLE( ). IND Solutio: Observe

More information

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >

More information

Verification of continuous predictands

Verification of continuous predictands barbara.casati@ec.gc.ca Verificatio of cotiuous predictads Barbara Casati 9 Ja 007 Exploratory ethods: joit distributio Scatter-plot: plot of observatio versus forecast values Perfect forecast obs, poits

More information

Lecture 12: September 27

Lecture 12: September 27 36-705: Itermediate Statistics Fall 207 Lecturer: Siva Balakrisha Lecture 2: September 27 Today we will discuss sufficiecy i more detail ad the begi to discuss some geeral strategies for costructig estimators.

More information

Probability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].

Probability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)]. Probability 2 - Notes 0 Some Useful Iequalities. Lemma. If X is a radom variable ad g(x 0 for all x i the support of f X, the P(g(X E[g(X]. Proof. (cotiuous case P(g(X Corollaries x:g(x f X (xdx x:g(x

More information

SDS 321: Introduction to Probability and Statistics

SDS 321: Introduction to Probability and Statistics SDS 321: Itroductio to Probability ad Statistics Lecture 23: Cotiuous radom variables- Iequalities, CLT Puramrita Sarkar Departmet of Statistics ad Data Sciece The Uiversity of Texas at Austi www.cs.cmu.edu/

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

November 2002 Course 4 solutions

November 2002 Course 4 solutions November Course 4 solutios Questio # Aswer: B φ ρ = = 5. φ φ ρ = φ + =. φ Solvig simultaeously gives: φ = 8. φ = 6. Questio # Aswer: C g = [(.45)] = [5.4] = 5; h= 5.4 5 =.4. ˆ π =.6 x +.4 x =.6(36) +.4(4)

More information

SECTION 2.6 THE SECOND ALTERNATIVE

SECTION 2.6 THE SECOND ALTERNATIVE 54 SECTION 2.6 THE SECOND ALTERNATIVE We ow discuss the probles where the Secod Alterative holds. The suppositio is that there is a otrivial solutio for L(y) =, B (y) = B 2 (y) =. The Fredhol Theores assure

More information

Distributional Similarity Models (cont.)

Distributional Similarity Models (cont.) Sematic Similarity Vector Space Model Similarity Measures cosie Euclidea distace... Clusterig k-meas hierarchical Last Time EM Clusterig Soft versio of K-meas clusterig Iput: m dimesioal objects X = {

More information

CSCI-6971 Lecture Notes: Stochastic processes

CSCI-6971 Lecture Notes: Stochastic processes CSCI-6971 Lecture Notes: Stochastic processes Kristopher R. Beevers Departet of Coputer Sciece Resselaer Polytechic Istitute beevek@cs.rpi.edu February 2, 2006 1 Overview Defiitio 1.1. A stochastic process

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

STATISTICAL INFERENCE

STATISTICAL INFERENCE STATISTICAL INFERENCE POPULATION AND SAMPLE Populatio = all elemets of iterest Characterized by a distributio F with some parameter θ Sample = the data X 1,..., X, selected subset of the populatio = sample

More information

Binomial transform of products

Binomial transform of products Jauary 02 207 Bioial trasfor of products Khristo N Boyadzhiev Departet of Matheatics ad Statistics Ohio Norther Uiversity Ada OH 4580 USA -boyadzhiev@ouedu Abstract Give the bioial trasfors { b } ad {

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

x = Pr ( X (n) βx ) =

x = Pr ( X (n) βx ) = Exercise 93 / page 45 The desity of a variable X i i 1 is fx α α a For α kow let say equal to α α > fx α α x α Pr X i x < x < Usig a Pivotal Quatity: x α 1 < x < α > x α 1 ad We solve i a similar way as

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

Optimal Estimator for a Sample Set with Response Error. Ed Stanek

Optimal Estimator for a Sample Set with Response Error. Ed Stanek Optial Estiator for a Saple Set wit Respose Error Ed Staek Itroductio We develop a optial estiator siilar to te FP estiator wit respose error tat was cosidered i c08ed63doc Te first 6 pages of tis docuet

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Distributional Similarity Models (cont.)

Distributional Similarity Models (cont.) Distributioal Similarity Models (cot.) Regia Barzilay EECS Departmet MIT October 19, 2004 Sematic Similarity Vector Space Model Similarity Measures cosie Euclidea distace... Clusterig k-meas hierarchical

More information

Pattern Classification

Pattern Classification Patter Classificatio All materials i these slides were tae from Patter Classificatio (d ed) by R. O. Duda, P. E. Hart ad D. G. Stor, Joh Wiley & Sos, 000 with the permissio of the authors ad the publisher

More information

Basics of Inference. Lecture 21: Bayesian Inference. Review - Example - Defective Parts, cont. Review - Example - Defective Parts

Basics of Inference. Lecture 21: Bayesian Inference. Review - Example - Defective Parts, cont. Review - Example - Defective Parts Basics of Iferece Lecture 21: Sta230 / Mth230 Coli Rudel Aril 16, 2014 U util this oit i the class you have almost exclusively bee reseted with roblems where we are usig a robability model where the model

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5 CS434a/54a: Patter Recogitio Prof. Olga Veksler Lecture 5 Today Itroductio to parameter estimatio Two methods for parameter estimatio Maimum Likelihood Estimatio Bayesia Estimatio Itroducto Bayesia Decisio

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

Comparison of Methods for Estimation of Sample Sizes under the Weibull Distribution

Comparison of Methods for Estimation of Sample Sizes under the Weibull Distribution Iteratioal Joural of Applied Egieerig Research ISSN 0973-4562 Volume 12, Number 24 (2017) pp. 14273-14278 Research Idia Publicatios. http://www.ripublicatio.com Compariso of Methods for Estimatio of Sample

More information

Lektion 4. Sources of variation. Attribute Chapter 6 Attribute control

Lektion 4. Sources of variation. Attribute Chapter 6 Attribute control Lektio 4 27--2 Chapter 6 Attribute cotrol Sources of variatio Chace causes (Slupässiga källor) Rado variatio Backgroud oise Statistical cotrol, stable process Assigable causes (Systeatiska källor) There

More information

Lecture Note 8 Point Estimators and Point Estimation Methods. MIT Spring 2006 Herman Bennett

Lecture Note 8 Point Estimators and Point Estimation Methods. MIT Spring 2006 Herman Bennett Lecture Note 8 Poit Estimators ad Poit Estimatio Methods MIT 14.30 Sprig 2006 Herma Beett Give a parameter with ukow value, the goal of poit estimatio is to use a sample to compute a umber that represets

More information

3 Markov Chain Monte Carlo

3 Markov Chain Monte Carlo 3 Markov Chai Mote Carlo This is a self-cotaied itroductio to Markov Chai Mote Carlo (MCMC), a samplig method for estimatig expectatios which has revolutioized the practice of Bayesia iferece. There are

More information

) is a square matrix with the property that for any m n matrix A, the product AI equals A. The identity matrix has a ii

) is a square matrix with the property that for any m n matrix A, the product AI equals A. The identity matrix has a ii square atrix is oe that has the sae uber of rows as colus; that is, a atrix. he idetity atrix (deoted by I, I, or [] I ) is a square atrix with the property that for ay atrix, the product I equals. he

More information

Probability Distributions

Probability Distributions Probability Distributions In Chapter, we ephasized the central role played by probability theory in the solution of pattern recognition probles. We turn now to an exploration of soe particular exaples

More information

Lecture 9: September 19

Lecture 9: September 19 36-700: Probability ad Mathematical Statistics I Fall 206 Lecturer: Siva Balakrisha Lecture 9: September 9 9. Review ad Outlie Last class we discussed: Statistical estimatio broadly Pot estimatio Bias-Variace

More information

Probability Theory. Exercise Sheet 4. ETH Zurich HS 2017

Probability Theory. Exercise Sheet 4. ETH Zurich HS 2017 ETH Zurich HS 2017 D-MATH, D-PHYS Prof. A.-S. Szita Coordiator Yili Wag Probability Theory Exercise Sheet 4 Exercise 4.1 Let X ) N be a sequece of i.i.d. rado variables i a probability space Ω, A, P ).

More information

Non-Homogeneous Poisson Processes Applied to Count Data: A Bayesian Approach Considering Different Prior Distributions

Non-Homogeneous Poisson Processes Applied to Count Data: A Bayesian Approach Considering Different Prior Distributions Joural of Evirometal Protectio, 202, 3, 336-345 http://dx.doi.org/0.4236/jep.202.3052 Published Olie October 202 (http://www.scirp.org/joural/jep) No-Homogeeous Poisso Processes Applied to Cout Data: A

More information

Non-asymptotic sequential confidence regions with fixed sizes for the multivariate nonlinear parameters of regression. Andrey V.

Non-asymptotic sequential confidence regions with fixed sizes for the multivariate nonlinear parameters of regression. Andrey V. No-asyptotic sequetial cofidece regios with fixed sizes for the ultivariate oliear paraeters of regressio Adrey V Tiofeev Abstract I this paper we cosider a sequetial desig for estiatio of o-liear paraeters

More information

The Binomial Multi- Section Transformer

The Binomial Multi- Section Transformer 4/4/26 The Bioial Multisectio Matchig Trasforer /2 The Bioial Multi- Sectio Trasforer Recall that a ulti-sectio atchig etwork ca be described usig the theory of sall reflectios as: where: ( ω ) = + e +

More information

Last Lecture. Wald Test

Last Lecture. Wald Test Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information