Some Examples on Gibbs Sampling and Metropolis-Hastings methods
|
|
- Edwina Watts
- 5 years ago
- Views:
Transcription
1 Soe Exaples o Gibbs Saplig ad Metropolis-Hastigs ethods S420/620 Itroductio to Statistical Theory, Fall 2012
2 Gibbs Sapler Saple a ultidiesioal probability distributio fro coditioal desities. Suppose d = 2, θ = (θ 1, θ 2 ). Set a iitial poit, θ (0) = (θ (0) 1, θ(0) 2 ). 1 Geerate a value θ (1) 2 fro π(θ 2 θ 1 = θ (0) 1, X = x). 2 Geerate a value θ (1) 1 fro π(θ 1 θ 2 = θ (1) 2, X = x). Two steps give a ew value θ (1) = (θ (0) 1, θ(0) 2 ) After several iteratios: θ (1), θ (2), θ (3),..., Saples (after traiig period) correspod to π(θ 1, θ 2 x). S420/620 Itroductio to Statistical Theory, Fall 2012
3 Exaple: Bioial-Beta Oe observatio, X Bioial(, θ) ad π(θ) = Beta(a, b) Bayes theore, π(θ x) θ x (1 θ) x θ a 1 (1 θ) b 1 The joit desity for (x, θ), ( ) f (x; θ)π(θ) = θ x x Γ(a + b) (1 θ) x Γ(a)Γ(b) θa 1 (1 θ) b 1 What is the argial distributio of X? Its a Bioial-Beta distributio, ( ) Γ(a + b) Γ(a + x)γ(b + x) f (x) =, x = 0, 1, 2,..., x Γ(a)Γ(b) Γ(a + b + ) S420/620 Itroductio to Statistical Theory, Fall 2012
4 Via Gibbs saplig. Set iitial value (x (0), θ (0) ). For i = 1, 2,... M 1 Saple x (i) f (x θ (i 1) ) = Bioial(, θ (i 1) ) 2 Saple θ (i) π(θ x (i) ) = Beta(a + x (i), b + x (i) ) 3 Repeat (1) ad (2) ay ties. The process produces saples of (x, θ). Approxiate f (x) with the values x (i) For exaple, f (x) = P(X = x) (# of X (i) = x)/m We ay also, (1) Saple θ (i) π(θ x (i 1) ) (2) Saple x (i) f (x θ (i) ). S420/620 Itroductio to Statistical Theory, Fall 2012
5 # Exaple 1:Beta-Bioial siulatio # Prior paraeters a=2; b=4; =16 # prepare objects to save iteratios it =1500 x= rep(na,it); th=rep(na,it) # set iitial value x[1]=1; th[1]=0.5 # Perfor Gibbs iteratios for (i i 2:it) { x[i] =rbio(1,size=,prob=th[i-1]) th[i] = rbeta(1,a+x[i],b+-x[i]) } S420/620 Itroductio to Statistical Theory, Fall 2012
6 Trace plots for siulatios of x ad θ x iteratio th iteratio S420/620 Itroductio to Statistical Theory, Fall 2012
7 Histogra x values (f(x)) Desity x S420/620 Itroductio to Statistical Theory, Fall 2012
8 Histogra θ values (π(θ) ) Desity theta S420/620 Itroductio to Statistical Theory, Fall 2012
9 I d-diesios, we iteratively use π i (θ i θ i, x) where θ i = (θ 1, θ 2,..., θ i1, θ i+1,..., θ d ). Gibbs saplig is a exaple of Markov Chai Mote Carlo (MCMC) ethods. A ew poit θ (s+1) depeds o θ (s) Mote Carlo: pseudo-rado values. The liit or statioary distributio is π(θ x). Gibbs saplig assues direct siulatio fro full coditioals. If full coditioal siulatio is ot possible, the we could use Metropolis-Hastigs. S420/620 Itroductio to Statistical Theory, Fall 2012
10 Exaple: Bioial-Beta-Poisso Treat as ukow with π() = Poisso(λ), (λ kow). X Bioial(, θ); π(θ) = Beta(a, b) The joit distributio for (X, θ, ) is: ( ) ) θ x x Γ(a + b) (1 θ) x Γ(a)Γ(b) θa 1 (1 θ) (e b 1 λ λ! x = 0, 1,..., ; 0 < θ < 1; = 0, 1, 2,.... Agai, iterested o f (x) but ipossible to fid i aalytic-closed for. Alteratively (get rid of costats), f (θ, x, ) ( x )θ a+x 1 (1 θ) b+ x 1 λ! S420/620 Itroductio to Statistical Theory, Fall 2012
11 For Gibbs Saplig, fid full coditioals: f (x θ, ) ( x) θ x (1 θ) x Bioial(, θ), π(θ x, ) θ a+x 1 (1 θ) b+ x 1 Beta(a + x, b + x), ) λ π( θ, x) ( x! (1 θ) x [λ(1 θ)] x ( x)! ; = x, x + 1, x + 2,..., If we set, z = x, the z Poisso(λ(1 θ)) Set (x (0), θ (0), (0) ). For i = 1, 2, 3,..., Saple x (i) Bioial( (i 1), θ (i 1) ). Saple θ (i) Beta(a + x (i), b + (i 1) x (i) ). Saple (i) = x (i) + z, z Poisso(λ(1 θ (i) )) Repeat util covergece is reached. S420/620 Itroductio to Statistical Theory, Fall 2012
12 Trace plot ad histogra of values iteratio Histogra of Desity S420/620 Itroductio to Statistical Theory, Fall 2012
13 Histogra of x values Desity x S420/620 Itroductio to Statistical Theory, Fall 2012
14 Exaple with Bivariate Noral Distributio x = (x 1, x 2 ), µ = (µ 1, µ 2 ) ad Σ is a 2 2 covariace atrix with diagoal etries σ1 2, σ2 2 ad off-diagoals σ 1,2. The pdf is ( f (x µ, Σ) Σ 1/2 exp 1 ) 2 (x µ)t Σ 1 (x µ) Margial distributios: x 1 N(µ 1, σ 2 1 ) ad x 2 N(µ 2, σ 2 2 ). For Gibbs saplig, we eed f (x 1 x 2 ) ad f (x 2 x 1 ). I fact f (x 1 x 2 ) = N(µ 1 + (σ 1,2 /σ 2 2 )(x 2 µ 2 ), σ 2 1 (σ 1,2/σ 2 ) 2 ) f (x 2 x 1 ) = N(µ 2 + (σ 1,2 /σ 2 1 )(x 1 µ 1 ), σ 2 2 (σ 1,2/σ 1 ) 2 ) Exaple with µ = (2, 1), σ 2 1 = σ2 2 = 1 ad σ 1,2 = 0.7. S420/620 Itroductio to Statistical Theory, Fall 2012
15 Trace plots for siulatios of x 1 ad x 2 xs[, 1] Idex xs[, 2] Idex S420/620 Itroductio to Statistical Theory, Fall 2012
16 Histogras for siulatios of x 1 ad x 2 Histogra of xs[-(1:100), 1] Desity x1 Histogra of xs[-(1:100), 2] Desity x2 S420/620 Itroductio to Statistical Theory, Fall 2012
17 British coal iig disasters data o. of disasters tie Histogra of y Desity y S420/620 Itroductio to Statistical Theory, Fall 2012
18 Exaple: Poisso process with a Chage Poit Hierarchical odel proposed i Carli, Gelfad ad Sith (1992) (also see Sectio 3.10). First stage: X i Poisso(µ), i = 1, 2,..., k. X i Poisso(λ), i = k + 1, k + 2,..., so X i represets the observatios ad the paraeter of iterest are (µ, λ, k). Data cosists of coal-iig disasters i the U.K. Secod stage: Idepedet priors o (µ, λ, k). k discrete uifor o {1, 2,..., }, =saple size. µ Ga(a 1, b 1 ) ad λ Ga(a 2, b 2 ) a 1, b 1, a 2, b 2 are fixed. S420/620 Itroductio to Statistical Theory, Fall 2012
19 The joit posterior distributio for (µ, λ, k) has the for: π(µ, λ, k X) f (X µ, λ, k)π(µ)π(λ)π(k) The likelihood fuctio is: f (X θ, λ, k) = = k f (x i µ, k) f (x i λ, k) i=1 k µ x i e µ i=1 x i! i=k+1 i=k+1 λ x i e λ x i! Therefore, π(µ, λ, k X) k µ x i e µ i=1 x i! i=k+1 λ x i e λ x i! ( ) (µ a1 1 e µb 1 )(λ a2 1 e λb 2 1 ) S420/620 Itroductio to Statistical Theory, Fall 2012
20 Or equivaletly, π(µ, λ, k X) µ a 1+ k i=1 x i 1 e µ(k+b 1) λ a 2+ i=k+1 x i 1 e λ( k+b 2) Full coditioal distributios, 1 π(µ λ, k, X) µ a 1+ k i=1 x i 1 e µ(k+b1) = Ga(a 1 + k i=1 x i, k + b 1 ) 2 π(λ µ, k, X) λ a 2+ i=k+1 x i 1 e λ( k+b2) = Ga(a 2 + i=k+1 x i, k + b 2 ) 3 π(k µ, λ, X) µ k i=1 x i λ i=k+1 x i e k(µ λ) L(X µ, λ, k) So, p(k µ, λ, X) = L(X µ,λ,k) k=1, k = 1, 2,..., L(X µ,λ,k) Soe results with the British Coal iig disaster data follow. S420/620 Itroductio to Statistical Theory, Fall 2012
21 k Series draws[, i] Histogra of draws[, i] theta ACF Desity iteratio Lag theta Series draws[, i] Histogra of draws[, i] labda ACF Desity iteratio Lag labda Series draws[, i] Histogra of draws[, i] ACF Desity iteratio Lag k S420/620 Itroductio to Statistical Theory, Fall 2012
22 Metropolis-Hastigs Algorith (cotiuous case) Saple (idirectly) a value X fro a pdf f (x). I Bayesia probles f (x) is the posterior distributio. Need a trial or proposal desity q(x, y) 0. 1 Set a iitial value X (0). For = 1, 2,..., 2 Give X () = x, geerate Y = y, fro q(x, y). 3 Copute, { α = i 1, } f (y)q(x, y) f (x)q(x, y) 4 U U(0, 1). If U α, the X (+1) = y. Otherwise, X (+1) = x 5 Chage to + 1 ad go back to 2. S420/620 Itroductio to Statistical Theory, Fall 2012
23 Coo to ru algorith say 1000 iteratios ad delete the first, say 100 (bur-i). Uder a syetric proposal (Metropolis), q(x, y) = q(y, x) ad { α = i 1, f (y) } f (x) Rado walk proposal, h is a scalig paraeter. y = x + z; z N(0, h 2 ). h is specified so that 40% 60% acceptace rate is achieved. S420/620 Itroductio to Statistical Theory, Fall 2012
24 Exaple with Bivariate Noral Distributio θ = (θ 1, θ 2 ), µ = (µ 1, µ 2 ) ad S is a 2 2 covariace atrix with diagoal etries s 1,1, s 2,2 ad off-diagoals s 1,2 = s 2,1. The pdf is ( π(θ) S 1/2 exp 1 ) 2 (θ µ)t S 1 (θ µ) Margial distributios: θ 1 N(µ 1, s 1,1 ) ad θ 2 N(µ 2, s 2,2 ). Exaple with µ = (2, 1), s 1,1 = s 2,2 = 1 ad s 1,2 = 0.7. S420/620 Itroductio to Statistical Theory, Fall 2012
25 Trial or cadidate desity q(θ, θ ), also a bivariate oral so θ = θ + z; z N(0, Σ) with Σ a diagoal atrix with diagoal etries 0.6, 0.4. z 1 N(0, 0.6) ad z 2 N(0, 0.4). Rado walk proposal so q(θ, θ ) = q(θ, θ). The Metropolis ratio is: { } α(θ, θ ) = i 1, π(θ ) = π(θ) ( ) exp 1 = i 1, 2 (θ µ) t S 1 (θ µ) exp ( 1 2 (θ µ)t S 1 (θ µ) ) S420/620 Itroductio to Statistical Theory, Fall 2012
26 Fro a iitial poit θ (0). For i = 1, 2,..., θ = θ (i 1) + z The copute the acceptace probability, α(θ (i 1), θ ). Make θ (i) = θ if U < α(θ (i 1), θ ). Otherwise, θ (i) = θ (i 1). Fro θ (0) = ( 1, 3), we ll look at: Tie series plots of θ = (θ 1, θ 2 ) values. Histogras of argial saples for θ 1 ad θ 2. Bivariate plots of saples. S420/620 Itroductio to Statistical Theory, Fall 2012
27 theta iteratio theta iteratio S420/620 Itroductio to Statistical Theory, Fall 2012
28 Margial for Theta theta1 Margial for Theta theta2 S420/620 Itroductio to Statistical Theory, Fall 2012
29 2 0-2 theta siulatios fr o the Bivariate Noral theta theta siulatios fro the M-H algorith theta 1 S420/620 Itroductio to Statistical Theory, Fall
Statistics for Applications Fall Problem Set 7
18.650. Statistics for Applicatios Fall 016. Proble Set 7 Due Friday, Oct. 8 at 1 oo Proble 1 QQ-plots Recall that the Laplace distributio with paraeter λ > 0 is the cotiuous probaλ bility easure with
More informationDefine a Markov chain on {1,..., 6} with transition probability matrix P =
Pla Group Work 0. The title says it all Next Tie: MCMC ad Geeral-state Markov Chais Midter Exa: Tuesday 8 March i class Hoework 4 due Thursday Uless otherwise oted, let X be a irreducible, aperiodic Markov
More informationBayesian Methods: Introduction to Multi-parameter Models
Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationQueueing Theory II. Summary. M/M/1 Output process Networks of Queue Method of Stages. General Distributions
Queueig Theory II Suary M/M/1 Output process Networks of Queue Method of Stages Erlag Distributio Hyperexpoetial Distributio Geeral Distributios Ebedded Markov Chais 1 M/M/1 Output Process Burke s Theore:
More informationContents Two Sample t Tests Two Sample t Tests
Cotets 3.5.3 Two Saple t Tests................................... 3.5.3 Two Saple t Tests Setup: Two Saples We ow focus o a sceario where we have two idepedet saples fro possibly differet populatios. Our
More informationAPPLIED MULTIVARIATE ANALYSIS
ALIED MULTIVARIATE ANALYSIS FREQUENTLY ASKED QUESTIONS AMIT MITRA & SHARMISHTHA MITRA DEARTMENT OF MATHEMATICS & STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANUR X = X X X [] The variace covariace atrix
More informationModelling Missing Data. Missing Data Mechanism. Problem: Some data Y ij may be missing. Complete-data model:
Coplete-data odel: Modellig Missig Data idepedet ad idetically distributed () draws Y,, Y fro ultivariate distributio P θ, Y i = (Y i,, Y ip ) T P θ Data atrix: Y = (Y,, Y ) T = (Y ij ),,;j=,,p Uits Y
More informationLecture 11 and 12: Basic estimation theory
Lecture ad 2: Basic estimatio theory Sprig 202 - EE 94 Networked estimatio ad cotrol Prof. Kha March 2 202 I. MAXIMUM-LIKELIHOOD ESTIMATORS The maximum likelihood priciple is deceptively simple. Louis
More informationStanford Statistics 311/Electrical Engineering 377
I. Uiversal predictio ad codig a. Gae: sequecex ofdata, adwattopredict(orcode)aswellasifwekew distributio of data b. Two versios: probabilistic ad adversarial. I either case, let p ad q be desities or
More informationEECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1
EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum
More informationSurveying the Variance Reduction Methods
Available olie at www.scizer.co Austria Joural of Matheatics ad Statistics, Vol 1, Issue 1, (2017): 10-15 ISSN 0000-0000 Surveyig the Variace Reductio Methods Arash Mirtorabi *1, Gholahossei Gholai 2 1.
More informationOn Modeling On Minimum Description Length Modeling. M-closed
O Modelig O Miiu Descriptio Legth Modelig M M-closed M-ope Do you believe that the data geeratig echais really is i your odel class M? 7 73 Miiu Descriptio Legth Priciple o-m-closed predictive iferece
More informationAN EFFICIENT ESTIMATION METHOD FOR THE PARETO DISTRIBUTION
Joural of Statistics: Advaces i Theory ad Applicatios Volue 3, Nuber, 00, Pages 6-78 AN EFFICIENT ESTIMATION METHOD FOR THE PARETO DISTRIBUTION Departet of Matheatics Brock Uiversity St. Catharies, Otario
More informationA string of not-so-obvious statements about correlation in the data. (This refers to the mechanical calculation of correlation in the data.
STAT-UB.003 NOTES for Wedesday 0.MAY.0 We will use the file JulieApartet.tw. We ll give the regressio of Price o SqFt, show residual versus fitted plot, save residuals ad fitted. Give plot of (Resid, Price,
More informationWe have also learned that, thanks to the Central Limit Theorem and the Law of Large Numbers,
Cofidece Itervals III What we kow so far: We have see how to set cofidece itervals for the ea, or expected value, of a oral probability distributio, both whe the variace is kow (usig the stadard oral,
More informationCS 70 Second Midterm 7 April NAME (1 pt): SID (1 pt): TA (1 pt): Name of Neighbor to your left (1 pt): Name of Neighbor to your right (1 pt):
CS 70 Secod Midter 7 April 2011 NAME (1 pt): SID (1 pt): TA (1 pt): Nae of Neighbor to your left (1 pt): Nae of Neighbor to your right (1 pt): Istructios: This is a closed book, closed calculator, closed
More informationAchieving Stationary Distributions in Markov Chains. Monday, November 17, 2008 Rice University
Istructor: Achievig Statioary Distributios i Markov Chais Moday, November 1, 008 Rice Uiversity Dr. Volka Cevher STAT 1 / ELEC 9: Graphical Models Scribe: Rya E. Guerra, Tahira N. Saleem, Terrace D. Savitsky
More informationPseudo-marginal Metropolis-Hastings: a simple explanation and (partial) review of theory
Pseudo-arginal Metropolis-Hastings: a siple explanation and (partial) review of theory Chris Sherlock Motivation Iagine a stochastic process V which arises fro soe distribution with density p(v θ ). Iagine
More informationBayesian inference for stochastic differential mixed effects models - initial steps
Bayesian inference for stochastic differential ixed effects odels - initial steps Gavin Whitaker 2nd May 2012 Supervisors: RJB and AG Outline Mixed Effects Stochastic Differential Equations (SDEs) Bayesian
More informationSome Basic Probability Concepts. 2.1 Experiments, Outcomes and Random Variables
Some Basic Probability Cocepts 2. Experimets, Outcomes ad Radom Variables A radom variable is a variable whose value is ukow util it is observed. The value of a radom variable results from a experimet;
More informationThe Bayesian Learning Framework. Back to Maximum Likelihood. Naïve Bayes. Simple Example: Coin Tosses. Given a generative model
Back to Maximum Likelihood Give a geerative model f (x, y = k) =π k f k (x) Usig a geerative modellig approach, we assume a parametric form for f k (x) =f (x; k ) ad compute the MLE θ of θ =(π k, k ) k=
More informationLECTURE NOTES 9. 1 Point Estimation. 1.1 The Method of Moments
LECTURE NOTES 9 Poit Estimatio Uder the hypothesis that the sample was geerated from some parametric statistical model, a atural way to uderstad the uderlyig populatio is by estimatig the parameters of
More informationECE 901 Lecture 4: Estimation of Lipschitz smooth functions
ECE 9 Lecture 4: Estiatio of Lipschitz sooth fuctios R. Nowak 5/7/29 Cosider the followig settig. Let Y f (X) + W, where X is a rado variable (r.v.) o X [, ], W is a r.v. o Y R, idepedet of X ad satisfyig
More informationarxiv: v1 [math.st] 12 Dec 2018
DIVERGENCE MEASURES ESTIMATION AND ITS ASYMPTOTIC NORMALITY THEORY : DISCRETE CASE arxiv:181.04795v1 [ath.st] 1 Dec 018 Abstract. 1) BA AMADOU DIADIÉ AND 1,,4) LO GANE SAMB 1. Itroductio 1.1. Motivatios.
More informationTomoki Toda. Augmented Human Communication Laboratory Graduate School of Information Science
Seuetial Data Modelig d class Basics of seuetial data odelig ooki oda Augeted Hua Couicatio Laboratory Graduate School of Iforatio Sciece Basic Aroaches How to efficietly odel joit robability of high diesioal
More informationUnbiased Estimation. February 7-12, 2008
Ubiased Estimatio February 7-2, 2008 We begi with a sample X = (X,..., X ) of radom variables chose accordig to oe of a family of probabilities P θ where θ is elemet from the parameter space Θ. For radom
More informationExponential Families and Bayesian Inference
Computer Visio Expoetial Families ad Bayesia Iferece Lecture Expoetial Families A expoetial family of distributios is a d-parameter family f(x; havig the followig form: f(x; = h(xe g(t T (x B(, (. where
More informationLecture 19. Curve fitting I. 1 Introduction. 2 Fitting a constant to measured data
Lecture 9 Curve fittig I Itroductio Suppose we are preseted with eight poits of easured data (x i, y j ). As show i Fig. o the left, we could represet the uderlyig fuctio of which these data are saples
More informationProbability and MLE.
10-701 Probability ad MLE http://www.cs.cmu.edu/~pradeepr/701 (brief) itro to probability Basic otatios Radom variable - referrig to a elemet / evet whose status is ukow: A = it will rai tomorrow Domai
More informationHOMEWORK I: PREREQUISITES FROM MATH 727
HOMEWORK I: PREREQUISITES FROM MATH 727 Questio. Let X, X 2,... be idepedet expoetial radom variables with mea µ. (a) Show that for Z +, we have EX µ!. (b) Show that almost surely, X + + X (c) Fid the
More informationECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015
ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],
More informationStatistics and Data Analysis in MATLAB Kendrick Kay, February 28, Lecture 4: Model fitting
Statistics ad Data Aalysis i MATLAB Kedrick Kay, kedrick.kay@wustl.edu February 28, 2014 Lecture 4: Model fittig 1. The basics - Suppose that we have a set of data ad suppose that we have selected the
More informationThe Binomial Multi-Section Transformer
4/15/2010 The Bioial Multisectio Matchig Trasforer preset.doc 1/24 The Bioial Multi-Sectio Trasforer Recall that a ulti-sectio atchig etwork ca be described usig the theory of sall reflectios as: where:
More informationChapter 2. Asymptotic Notation
Asyptotic Notatio 3 Chapter Asyptotic Notatio Goal : To siplify the aalysis of ruig tie by gettig rid of details which ay be affected by specific ipleetatio ad hardware. [1] The Big Oh (O-Notatio) : It
More informationSummary. Recap ... Last Lecture. Summary. Theorem
Last Lecture Biostatistics 602 - Statistical Iferece Lecture 23 Hyu Mi Kag April 11th, 2013 What is p-value? What is the advatage of p-value compared to hypothesis testig procedure with size α? How ca
More informationAsymptotics. Hypothesis Testing UMP. Asymptotic Tests and p-values
of the secod half Biostatistics 6 - Statistical Iferece Lecture 6 Fial Exam & Practice Problems for the Fial Hyu Mi Kag Apil 3rd, 3 Hyu Mi Kag Biostatistics 6 - Lecture 6 Apil 3rd, 3 / 3 Rao-Blackwell
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationCS Lecture 13. More Maximum Likelihood
CS 6347 Lecture 13 More Maxiu Likelihood Recap Last tie: Introduction to axiu likelihood estiation MLE for Bayesian networks Optial CPTs correspond to epirical counts Today: MLE for CRFs 2 Maxiu Likelihood
More informationWill Monroe August 9, with materials by Mehran Sahami and Chris Piech. image: Arito. Parameter learning
Will Monroe August 9, 07 with aterials by Mehran Sahai and Chris Piech iage: Arito Paraeter learning Announceent: Proble Set #6 Goes out tonight. Due the last day of class, Wednesday, August 6 (before
More informationIntroduction to Optimization, DIKU Monday 19 November David Pisinger. Duality, motivation
Itroductio to Optiizatio, DIKU 007-08 Moday 9 Noveber David Pisiger Lecture, Duality ad sesitivity aalysis Duality, shadow prices, sesitivity aalysis, post-optial aalysis, copleetary slackess, KKT optiality
More information5. Likelihood Ratio Tests
1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,
More informationHypothesis tests and confidence intervals
Hypothesis tests ad cofidece itervals The 95% cofidece iterval for µ is the set of values, µ 0, such that the ull hypothesis H 0 : µ = µ 0 would ot be rejected by a two-sided test with α = 5%. The 95%
More informationData Analysis and Statistical Methods Statistics 651
Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tau.edu/~suhasii/teachig.htl Suhasii Subba Rao Exaple The itroge cotet of three differet clover plats is give below. 3DOK1 3DOK5 3DOK7
More informationLearning Theory for Conditional Risk Minimization: Supplementary Material
Learig Theory for Coditioal Risk Miiizatio: Suppleetary Material Alexader Zii IST Austria azii@istacat Christoph H Lapter IST Austria chl@istacat Proofs Proof of Theore After the applicatio of (6) ad (8)
More informationMaximum Likelihood Estimation
Chapter 9 Maximum Likelihood Estimatio 9.1 The Likelihood Fuctio The maximum likelihood estimator is the most widely used estimatio method. This chapter discusses the most importat cocepts behid maximum
More informationRecursive Updating Fixed Parameter
Recursive Udatig Fixed Parameter So far we ve cosidered the sceario where we collect a buch of data ad the use that (ad the rior PDF) to comute the coditioal PDF from which we ca the get the MAP, or ay
More information5.6 Binomial Multi-section Matching Transformer
4/14/21 5_6 Bioial Multisectio Matchig Trasforers 1/1 5.6 Bioial Multi-sectio Matchig Trasforer Readig Assiget: pp. 246-25 Oe way to axiize badwidth is to costruct a ultisectio Γ f that is axially flat.
More informationMixture models (cont d)
6.867 Machie learig, lecture 5 (Jaakkola) Lecture topics: Differet types of ixture odels (cot d) Estiatig ixtures: the EM algorith Mixture odels (cot d) Basic ixture odel Mixture odels try to capture ad
More informationDirection: This test is worth 150 points. You are required to complete this test within 55 minutes.
Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem
More informationGeneralized Semi- Markov Processes (GSMP)
Geeralized Semi- Markov Processes (GSMP) Summary Some Defiitios Markov ad Semi-Markov Processes The Poisso Process Properties of the Poisso Process Iterarrival times Memoryless property ad the residual
More informationSample size calculations. $ available $ per sample
Saple size calculatios = $ available $ per saple Too few aials A total waste Too ay aials A partial waste Power X 1,...,X iid oralµ A, A Y 1,...,Y iid oralµ B, B Test H 0 : µ A = µ B vs H a : µ A µ B at
More informationThe Hypergeometric Coupon Collection Problem and its Dual
Joural of Idustrial ad Systes Egieerig Vol., o., pp -7 Sprig 7 The Hypergeoetric Coupo Collectio Proble ad its Dual Sheldo M. Ross Epstei Departet of Idustrial ad Systes Egieerig, Uiversity of Souther
More informationDirection: This test is worth 250 points. You are required to complete this test within 50 minutes.
Term Test October 3, 003 Name Math 56 Studet Number Directio: This test is worth 50 poits. You are required to complete this test withi 50 miutes. I order to receive full credit, aswer each problem completely
More informationJanuary 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS
Jauary 25, 207 INTRODUCTION TO MATHEMATICAL STATISTICS Abstract. A basic itroductio to statistics assumig kowledge of probability theory.. Probability I a typical udergraduate problem i probability, we
More informationEXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY
EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber
More information5.6 Binomial Multi-section Matching Transformer
4/14/2010 5_6 Bioial Multisectio Matchig Trasforers 1/1 5.6 Bioial Multi-sectio Matchig Trasforer Readig Assiget: pp. 246-250 Oe way to axiize badwidth is to costruct a ultisectio Γ f that is axially flat.
More informationThis exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.
Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the
More informationA PROBABILITY PROBLEM
A PROBABILITY PROBLEM A big superarket chai has the followig policy: For every Euros you sped per buy, you ear oe poit (suppose, e.g., that = 3; i this case, if you sped 8.45 Euros, you get two poits,
More informationAcoustic Field inside a Rigid Cylinder with a Point Source
Acoustic Field iside a Rigid Cylider with a Poit Source 1 Itroductio The ai objectives of this Deo Model are to Deostrate the ability of Coustyx to odel a rigid cylider with a poit source usig Coustyx
More informationMachine Learning Basics: Estimators, Bias and Variance
Machine Learning Basics: Estiators, Bias and Variance Sargur N. srihari@cedar.buffalo.edu This is part of lecture slides on Deep Learning: http://www.cedar.buffalo.edu/~srihari/cse676 1 Topics in Basics
More informationMathematical Statistics Anna Janicka
Mathematical Statistics Aa Jaicka Lecture XIV, 5.06.07 BAYESIAN STATISTICS Pla for Today. BayesiaStatistics a priori ad a posteriori distributios Bayesia estimatio: Maximum a posteriori probability(map)
More informationSolutions: Homework 3
Solutios: Homework 3 Suppose that the radom variables Y,...,Y satisfy Y i = x i + " i : i =,..., IID where x,...,x R are fixed values ad ",...," Normal(0, )with R + kow. Fid ˆ = MLE( ). IND Solutio: Observe
More informationFirst Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise
First Year Quatitative Comp Exam Sprig, 2012 Istructio: There are three parts. Aswer every questio i every part. Questio I-1 Part I - 203A A radom variable X is distributed with the margial desity: >
More informationVerification of continuous predictands
barbara.casati@ec.gc.ca Verificatio of cotiuous predictads Barbara Casati 9 Ja 007 Exploratory ethods: joit distributio Scatter-plot: plot of observatio versus forecast values Perfect forecast obs, poits
More informationLecture 12: September 27
36-705: Itermediate Statistics Fall 207 Lecturer: Siva Balakrisha Lecture 2: September 27 Today we will discuss sufficiecy i more detail ad the begi to discuss some geeral strategies for costructig estimators.
More informationProbability 2 - Notes 10. Lemma. If X is a random variable and g(x) 0 for all x in the support of f X, then P(g(X) 1) E[g(X)].
Probability 2 - Notes 0 Some Useful Iequalities. Lemma. If X is a radom variable ad g(x 0 for all x i the support of f X, the P(g(X E[g(X]. Proof. (cotiuous case P(g(X Corollaries x:g(x f X (xdx x:g(x
More informationSDS 321: Introduction to Probability and Statistics
SDS 321: Itroductio to Probability ad Statistics Lecture 23: Cotiuous radom variables- Iequalities, CLT Puramrita Sarkar Departmet of Statistics ad Data Sciece The Uiversity of Texas at Austi www.cs.cmu.edu/
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationNovember 2002 Course 4 solutions
November Course 4 solutios Questio # Aswer: B φ ρ = = 5. φ φ ρ = φ + =. φ Solvig simultaeously gives: φ = 8. φ = 6. Questio # Aswer: C g = [(.45)] = [5.4] = 5; h= 5.4 5 =.4. ˆ π =.6 x +.4 x =.6(36) +.4(4)
More informationSECTION 2.6 THE SECOND ALTERNATIVE
54 SECTION 2.6 THE SECOND ALTERNATIVE We ow discuss the probles where the Secod Alterative holds. The suppositio is that there is a otrivial solutio for L(y) =, B (y) = B 2 (y) =. The Fredhol Theores assure
More informationDistributional Similarity Models (cont.)
Sematic Similarity Vector Space Model Similarity Measures cosie Euclidea distace... Clusterig k-meas hierarchical Last Time EM Clusterig Soft versio of K-meas clusterig Iput: m dimesioal objects X = {
More informationCSCI-6971 Lecture Notes: Stochastic processes
CSCI-6971 Lecture Notes: Stochastic processes Kristopher R. Beevers Departet of Coputer Sciece Resselaer Polytechic Istitute beevek@cs.rpi.edu February 2, 2006 1 Overview Defiitio 1.1. A stochastic process
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationSTATISTICAL INFERENCE
STATISTICAL INFERENCE POPULATION AND SAMPLE Populatio = all elemets of iterest Characterized by a distributio F with some parameter θ Sample = the data X 1,..., X, selected subset of the populatio = sample
More informationBinomial transform of products
Jauary 02 207 Bioial trasfor of products Khristo N Boyadzhiev Departet of Matheatics ad Statistics Ohio Norther Uiversity Ada OH 4580 USA -boyadzhiev@ouedu Abstract Give the bioial trasfors { b } ad {
More informationStatisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
More informationx = Pr ( X (n) βx ) =
Exercise 93 / page 45 The desity of a variable X i i 1 is fx α α a For α kow let say equal to α α > fx α α x α Pr X i x < x < Usig a Pivotal Quatity: x α 1 < x < α > x α 1 ad We solve i a similar way as
More informationStat 319 Theory of Statistics (2) Exercises
Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.
More informationOptimal Estimator for a Sample Set with Response Error. Ed Stanek
Optial Estiator for a Saple Set wit Respose Error Ed Staek Itroductio We develop a optial estiator siilar to te FP estiator wit respose error tat was cosidered i c08ed63doc Te first 6 pages of tis docuet
More informationLecture 33: Bootstrap
Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationDistributional Similarity Models (cont.)
Distributioal Similarity Models (cot.) Regia Barzilay EECS Departmet MIT October 19, 2004 Sematic Similarity Vector Space Model Similarity Measures cosie Euclidea distace... Clusterig k-meas hierarchical
More informationPattern Classification
Patter Classificatio All materials i these slides were tae from Patter Classificatio (d ed) by R. O. Duda, P. E. Hart ad D. G. Stor, Joh Wiley & Sos, 000 with the permissio of the authors ad the publisher
More informationBasics of Inference. Lecture 21: Bayesian Inference. Review - Example - Defective Parts, cont. Review - Example - Defective Parts
Basics of Iferece Lecture 21: Sta230 / Mth230 Coli Rudel Aril 16, 2014 U util this oit i the class you have almost exclusively bee reseted with roblems where we are usig a robability model where the model
More informationCS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5
CS434a/54a: Patter Recogitio Prof. Olga Veksler Lecture 5 Today Itroductio to parameter estimatio Two methods for parameter estimatio Maimum Likelihood Estimatio Bayesia Estimatio Itroducto Bayesia Decisio
More information4. Partial Sums and the Central Limit Theorem
1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems
More informationComparison of Methods for Estimation of Sample Sizes under the Weibull Distribution
Iteratioal Joural of Applied Egieerig Research ISSN 0973-4562 Volume 12, Number 24 (2017) pp. 14273-14278 Research Idia Publicatios. http://www.ripublicatio.com Compariso of Methods for Estimatio of Sample
More informationLektion 4. Sources of variation. Attribute Chapter 6 Attribute control
Lektio 4 27--2 Chapter 6 Attribute cotrol Sources of variatio Chace causes (Slupässiga källor) Rado variatio Backgroud oise Statistical cotrol, stable process Assigable causes (Systeatiska källor) There
More informationLecture Note 8 Point Estimators and Point Estimation Methods. MIT Spring 2006 Herman Bennett
Lecture Note 8 Poit Estimators ad Poit Estimatio Methods MIT 14.30 Sprig 2006 Herma Beett Give a parameter with ukow value, the goal of poit estimatio is to use a sample to compute a umber that represets
More information3 Markov Chain Monte Carlo
3 Markov Chai Mote Carlo This is a self-cotaied itroductio to Markov Chai Mote Carlo (MCMC), a samplig method for estimatig expectatios which has revolutioized the practice of Bayesia iferece. There are
More information) is a square matrix with the property that for any m n matrix A, the product AI equals A. The identity matrix has a ii
square atrix is oe that has the sae uber of rows as colus; that is, a atrix. he idetity atrix (deoted by I, I, or [] I ) is a square atrix with the property that for ay atrix, the product I equals. he
More informationProbability Distributions
Probability Distributions In Chapter, we ephasized the central role played by probability theory in the solution of pattern recognition probles. We turn now to an exploration of soe particular exaples
More informationLecture 9: September 19
36-700: Probability ad Mathematical Statistics I Fall 206 Lecturer: Siva Balakrisha Lecture 9: September 9 9. Review ad Outlie Last class we discussed: Statistical estimatio broadly Pot estimatio Bias-Variace
More informationProbability Theory. Exercise Sheet 4. ETH Zurich HS 2017
ETH Zurich HS 2017 D-MATH, D-PHYS Prof. A.-S. Szita Coordiator Yili Wag Probability Theory Exercise Sheet 4 Exercise 4.1 Let X ) N be a sequece of i.i.d. rado variables i a probability space Ω, A, P ).
More informationNon-Homogeneous Poisson Processes Applied to Count Data: A Bayesian Approach Considering Different Prior Distributions
Joural of Evirometal Protectio, 202, 3, 336-345 http://dx.doi.org/0.4236/jep.202.3052 Published Olie October 202 (http://www.scirp.org/joural/jep) No-Homogeeous Poisso Processes Applied to Cout Data: A
More informationNon-asymptotic sequential confidence regions with fixed sizes for the multivariate nonlinear parameters of regression. Andrey V.
No-asyptotic sequetial cofidece regios with fixed sizes for the ultivariate oliear paraeters of regressio Adrey V Tiofeev Abstract I this paper we cosider a sequetial desig for estiatio of o-liear paraeters
More informationThe Binomial Multi- Section Transformer
4/4/26 The Bioial Multisectio Matchig Trasforer /2 The Bioial Multi- Sectio Trasforer Recall that a ulti-sectio atchig etwork ca be described usig the theory of sall reflectios as: where: ( ω ) = + e +
More informationLast Lecture. Wald Test
Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More information