1 Bayesian Computation.
|
|
- Berenice Evans
- 5 years ago
- Views:
Transcription
1 ISyE8843A, Brai Vidakovic Hadout 9 1 Bayesia Computatio If the selectio of a adequate prior was the major coceptual ad modelig challege of Bayesia aalysis, the major implemetatioal challege is computatio As soo as the model deviates from the cojugate structure, fidig the posterior (first the margial) distributio ad the Bayes rule is all but simple A closed form solutio is more a exceptio tha the rule, ad eve for such closed form solutios, lucky mathematical coicideces, coveiet mixtures, ad other tricks are eeded Up to this poit I believe you got a sese of this calculatioal challege If classical statistics relies o optimizatio, Bayesia statistics relies o itegratio The margial eeded for the posterior is a itegral m(x) = f(x θ)π(θ)dθ, ad the Bayes estimator of h(θ), with respect to the squared error loss is a ratio of itegrals, Θ δ π (x) = h(θ)π(θ x)dθ = h(θ)f(x θ)π(θ)dθ Θ f(x θ)π(θ)dθ Θ Θ The difficulties i calculatig the above Bayes rule are that (i) the posterior caot be represeted i a fiite form, ad (ii) the itegral of h(θ) does ot have a closed form itegral uder the possibly closed form posterior distributio Adoptig a differet loss fuctio usually makes calculatio eve more difficult A exceptio is absolute loss for which the Bayes rule is the mode of the posterior, ad the mode is ot iflueced by ormalizig (trouble makig) costat, m(x) The last two decades of research i Bayesia statistics cotributed to tremedous broadeig of the scope of Bayesia models Models that could ot be hadled before are ow routiely solved This is doe by Markov Chai Mote Carlo (MCMC) Methods, ad their itroductio to the field of statistics revolutioized Bayesia statistics This hadout overviews pre MCMC techiques: Mote Carlo Itegratio, Importace Samplig, ad Aalytic Approximatios (Riema, Laplace, ad Saddlepoit) 11 Bayesia CLT Suppose that 1,,, f(x θ), where θ is p-dimesioal parameter, ad that the prior o θ is π(θ) The prior π(θ) could be improper, but we assume that the posterior is proper ad that its mode exists The, whe, where θ M is posterior mode, ie, a solutio of [θ x] MVN p (θ M, H 1 (θ M )), π (θ x) θ i = 0, i = 1,, p, where π (θ x) = f(x θ)π(θ) is o-ormalized posterior Let H be the Hessia defied as ( π ) (θ x) H(θ) = θ i θ j 1
2 The asymptotic covariace matrix is H 1 (θ M ) = (H(θ)) 1 θ=θm The proof ca be foud i stadard texts o asymptotic theory Example: Beroulli s Assume that 1,, Ber(θ) ad that the prior o θ is 1 Show that θ M =, P H(θ) = i + P i This gives H 1 (θ θ 1 θ M ) = θ M (1 θ M The Bayesia CLT gives expected result, [θ 1,, ] N (θ M, θ M(1 θ M ) ) Example: Poisso/Gamma Let 1,, Poi(θ) ad θ θ α 1 exp{ βθ} The, ( α + i 1 [θ x] N, α + ) i 1 + β ( + β) This follows from the fact that the mode is θ M = α+p i 1 +β ad that H(θ) = α+p i 1 θ Posterior approximatios by usig Bayesia Cetral Limit Theorem are called first order approximatios or modal approximatios Sice the posterior is approximated by ormal distributio, this approximatio may be poor if the true posterior is skewed or if the sample size is small 1 Laplace Approximatio Suppose we are iterested i fidig A f(x θ)dx for a particular value of θ Let f(x θ) be represeted as exp{ h(x θ)} ( Let x θ is the value of x that miimizes h(x θ) (or equivaletly maximizes f(x θ)) If h (x θ θ) = h(x θ), the x )x=x θ b a e h(x θ) dx e h(x θ θ) θ h (x θ θ) [ Φ( h (x θ θ)(b x θ ) Φ( ] h (x θ θ)(a x θ ) Usig Laplace method, approximate Gamma itegral b x α 1 a Γ(α)β e x/β dx, for α = 3, β = 3 ad (a, b) = α (3, 5), (7, 1), ad (5, ) Compare with exact values ad discuss Cosider the posterior expectatio of iterest, E θ x (g(θ)) = Θ g(θ)f(x θ)π(θ)dθ Θ f(x θ)π(θ)dθ = Θ b N(θ) exp{ h N (θ)}dθ Θ b D(θ) exp{ h D (θ)}dθ If h N (θ) = h D (θ), it is said that the represetatio is i stadard form If, o the other had, b N (θ) = b D (θ), it is said that the represetatio is i fully expoetial form If E θ x (g(θ)) ca be writte i the stadard form, α α+β E θ x (g(θ)) = ĝ + σ D ˆ b Dĝ ˆ b D + σ ˆ D g + σ4 ˆ D h ĝ + O( ) For the fully expoetial form, if g is positive ad g(θ D ) is uiformly bouded away from zero, E θ x (g(θ)) = ˆb N σn ˆbD σd exp{ (ĥn ĥd)} + O( ) To illustrate the above formulas, let s fid approximatio to expectatio of Beta B(α, β) distributio,
3 13 Classical Mote Carlo Itegratio Suppose F (x) is a probability distributio, ad h(x) is a measurable fuctio for which Eh() < whe F Let 1,, be a sample from F The This is the same as if we wrote h(x)df (x) = E h() 1 h(x)df (x) P 1( i x) h( i ) h(x)df (x 1,, ), where F (x 1,, ) = is the empirical distributio fuctio For simplicity of otatio, assume that F is cotiuous ad that desity is f, although all results remai valid for geeral probability distributios Sice, by assumptio, Eh() = h(x)f(x)dx is fiite for f, from the strog law of large umbers (SLLN) it follows I = 1 h( i ) as h(x)f(x)dx = I The symbol as stads for almost sure covergece, meaig that the the covergece fails o the set that has probability 0 The speed of this almost sure covergece is measured by the speed of decay of variace of the Mote Carlo approximatios Theoretically, V ar(i ) = I(1 I) Example: Normal Likelihood/Cauchy Prior The followig model is give: θ N (θ, 1) θ Ca(0, 1) The Bayes rule is approximately equal to δ π (x) = R R θ exp{ 1/(x θ) }dθ 1+θ 1 exp{ 1/(x θ) 1+θ }dθ δ π (x) η i 1+ηi 1 1+ηi, where η i N (, 1) Assume that = The high precisio umerical algorithms (up to 0 decimal places) i MATHEMAT- ICA give (a) δ() = , ad (b) P θ (θ 1) = Cosider those exact values ad apply the simulatio to check the performace of Mote Carlo Uiform o -Sphere This is a example of Christia Robert that is a multivariate geeralizatio of the represetatio of symmetric distributios as (scale) mixture of uiforms 3
4 Assume θ MV N p (θ, I), θ c U( θ = c), c Ga(α, β) Assume that for α =, β = 3 ad x = (0, 0, 0, 0, 0, 1) we wat to fid Bayes rule, δ(x) Robert (199, 001) shows that the Bayes rule ca be expressed i almost closed form (up to cofluet hypergeometric fuctios), δ(x) = α p β 1F 1 (α + 1; (p + )/; x /( + 4β)) 1F 1 (α; p/; x, /( + 4β)) where 1 F 1 (a, b; z) = k=1 (a) k/(b) k z k /k!, ad (a) k = a(a+1) (a+k 1) Thus, δ(x) = (0, 0, 0, 0, 0, ) To approximate this rule by MC method, oe may do prior simulatio 1 Geerate c from Ga(, 3) [BayesLab fuctio rad gamma] Simulate M uiform variates θ o 6-dimesioal sphere The polar represetatio of a elemet θ from sphere is: θ 1 = c cos ϕ 1 θ = c si ϕ 1 cos ϕ θ 3 = c si ϕ 1 si ϕ cos ϕ 3 θ p 1 = c si ϕ 1 si ϕ cos ϕ p 1 θ p = c si ϕ 1 si ϕ si ϕ p 1 where 0 ϕ 1,, ϕ p π ad 0 ϕ p 1 π If ϕ-agles are selected uiformly from their respective domais, the θ is uiformly distributed o the sphere of radius c agles = pi*rad(1,p-1); agles(p-1) = *agles(p-1); agles(p)=0; theta(1)=sqrt(c) * cos(agles(1)); for i=:p theta(i)=theta(i-1)*si(agles(i-1))/cos(agles(i-1))*cos(agles(i)); ed 3 Approximate δ by M θ i exp{ x θ i /} M exp{ x θ i /} Ripley s Example Cosider p = dx, the tail of the stadard Cauchy distributio Of course, a π(1+x ) Cauchy radom variable has a explicit cumulative distributio fuctio, F (x) = π arcta(x) ad the above tail probability is p = 1 1 π arcta() = 1 π arcta(1/) = Discuss! 4
5 14 Importace Samplig Importace samplig, or weighted samplig, is a Mote Carlo techique i which the itegral of iterest is trasformed i a coveiet way to ehace the simulatio Suppose f is a desity ad h(x)f(x)dx is of iterest Assume that samplig from f is either difficult or impossible ad direct applicatio of Mote Carlo method is hard The idea of importace samplig is to multiply ad divide the expressio h(x)f(x) by a coveiet desity g(x), h(x)f(x) g(x) g(x) The, h(x)f(x) h(x)f(x)dx = g(x)dx g(x) The coditioally, the desity g is easy to sample from, ad the Mote Carlo approximatio to the itegral is h(x)f(x) g(x)dx 1 g(x) h( i )f( i ) iid, i g (1) g( i ) The desity g is called the importace desity ad its choice depeds o f There are several guidelies to how to select the importace desity g A obvious requiremet is that support of f has to be subset of support of g, supp(f) supp(g) sice otherwise we may have a udefied itegrad Several authors ivestigated the form of importace desity that miimizes the variace of the simulatios Theoretical results are available but the optimal desity requires kowledge of h(x)f(x)dx, the itegral we are approximatig, ad has o practical value However, from the form of the optimal desity g, oe cocludes that for importace desities for which h f/g is almost costat ad has fiite variace, the importace scheme works well A attractive feature of importace samplig is that a sigle radom sample from g ca be used for differet f ad h If the ratio f/g i (1) is kow up to a costat, which may ofte be the case i Bayesia calculatios, the followig approximatio is used, Refereces h(x)f(x) g(x)dx g(x) h( i )f( i ) / g( i ) f( i ) g( i ), i iid g [1] Robert, C (001) Bayesia Choice, Secod Editio, Spriger Verlag Appedix: Simulatio of Radom Numbers The BayesLab c matlab suite cotais radom umber geerators for major discrete ad cotiuous probability laws Here we give a couple of geeral methods that may be of help if the distributio is out of list Theorem 11 (Iverse Trasformatio Method) Let U be the uiform (0,1) radom variable ad F a cdf for which F 1 exists The F 1 (U) is a draw from distributio F 5
6 Expoetial Radom Variates Sice for the expoetial E(λ) distributio y = F (x) = 1 e λx, λ, x 0, the iverse fuctio is x = 1 λ log(1 y) Thus = 1 log(1 U), λ has E(λ) distributio I fact, sice U d = (1 U) oe ca use = 1 log U λ Accept-Reject Method (ARM) This method was origially proposed by vo Neuma Give the desity of iterest (target desity), f, fid proposal desity (evelope desity, istrumetal desity) g such that ( x supp(f)) f(x) Mg(x) The algorithm is: Step 1 Geerate a cadidate g Geerate U U(0, 1) Step Accept Y = if U f() Mg() ; Step 3 Retur to Step 1 Ideed, this geerates Y f The distributio of Y is give by ( P (Y y) = P y U f() ) = Mg() P (U This ratio is, P (Y y) = y f(x)/mg(x) 0 du g(x)dx y 0 du g(x)dx P ( y, U f() Mg() ) f() Mg() ) = 1/M y f(x)dx y 1/M f(x)dx = f(x)dx Sice each proposal will be accepted with probability P (U f() Mg() ) = 1/M, ( success ) the umber of trials ecessary to produce a draw from f is geometric Ge(1/M) The tight boud o M will icrease efficiecy as well as proposal desities with boud M close to 1 Exercises: 1 Usig ARM, geerate from calbe(, 4) usig uiform U(0, 1) proposals Show first that 0x(1 x) 3 is maximized at x = 1/4 ad that M = 135/64 Geeratig radom ormal is of great iterest ad several well established methods exist Here is a ARM versio Take proposals from DE(1), g(x) = 1 e x They are simply expoetials E(1) multiplied by radom sig, ie, S = B 1, where B Ber(1/) Estimate M (M e/π 13155) 3 The desity f(x) = ( cos ( )) πx, 1 x 1, called Bickel-Levit prior is of iterest i some areas of decisio theory (approximatio to the least favorable prior i estimatig a bouded ormal mea) Propose ARM method for simulatig from Bickel-Levit prior 4 Let f(x θ) be DE(θ, 1), ad let the prior distributio for θ is a symmetric two-poit distributio (cocetrated at µ ad µ, µ > 0) 6
7 (a) Fid the margial distributio, m(x) (b) Propose a samplig scheme to draw from m(x) 5 Suppose that Y 1,, Y Ber(θ i ), i = 1,, Suppose that vector of covariates i = ( i1,, ip ) correspod to each Y i, ad that θ i = exp{ i β} 1 + exp{ i β}, where β is the vector of regressio coefficiets (a) Show that the likelihood is { } f(x β) = exp [Y i iβ log(1 + e i β )] (b) Assume π(β) = 1 Show that the posterior mode is the MLE for β, ad (c) Show logπ (β y) β i β j = k=1 exp{ i ki β} kj (1 + exp{ i, 1 i, j p β}) [β Y ] MVN p ( ˆβ mle, ( V ) 1 ), where ( exp{ 1 V = diag ˆβ mle } (1 + exp{ 1 ˆβ mle }),, exp{ ˆβ ) mle } (1 + exp{ ˆβ mle }) 5 Show that correlated radom variables F 1 (U) ad G 1 (U) have maximum positive correlatio ad have the distributios F ad G; for maximal egative correlatio it is eough take G 1 ( U) Uiform o Sphere This MATHEMATICA code gives the exact value of the Bayes rule for x = (0, 0, 0, 0, 0, 1) The code alpha=; beta=3; p=6;x = Table[0,{p}]; x[[6]]=1; delta = * alpha/p 1/( + beta) Hypergeometric1F1[alpha+1, (p+)/, Norm[x]ˆ/( + 4 beta)]/ Hypergeometric1F1[alpha,p/, Norm[x]ˆ/( + 4 beta)] x//n results i δ = {0, 0, 0, 0, 0, } 7
Monte Carlo Integration
Mote Carlo Itegratio I these otes we first review basic umerical itegratio methods (usig Riema approximatio ad the trapezoidal rule) ad their limitatios for evaluatig multidimesioal itegrals. Next we itroduce
More informationExponential Families and Bayesian Inference
Computer Visio Expoetial Families ad Bayesia Iferece Lecture Expoetial Families A expoetial family of distributios is a d-parameter family f(x; havig the followig form: f(x; = h(xe g(t T (x B(, (. where
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationChapter 2 The Monte Carlo Method
Chapter 2 The Mote Carlo Method The Mote Carlo Method stads for a broad class of computatioal algorithms that rely o radom sampligs. It is ofte used i physical ad mathematical problems ad is most useful
More informationDistribution of Random Samples & Limit theorems
STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to
More information17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15
17. Joit distributios of extreme order statistics Lehma 5.1; Ferguso 15 I Example 10., we derived the asymptotic distributio of the maximum from a radom sample from a uiform distributio. We did this usig
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationMonte Carlo Methods: Lecture 3 : Importance Sampling
Mote Carlo Methods: Lecture 3 : Importace Samplig Nick Whiteley 16.10.2008 Course material origially by Adam Johase ad Ludger Evers 2007 Overview of this lecture What we have see... Rejectio samplig. This
More informationTopic 10: The Law of Large Numbers
Topic : October 6, 2 If we choose adult Europea males idepedetly ad measure their heights, keepig a ruig average, the at the begiig we might see some larger fluctuatios but as we cotiue to make measuremets,
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013. Large Deviations for i.i.d. Random Variables
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013 Large Deviatios for i.i.d. Radom Variables Cotet. Cheroff boud usig expoetial momet geeratig fuctios. Properties of a momet
More informationAAEC/ECON 5126 FINAL EXAM: SOLUTIONS
AAEC/ECON 5126 FINAL EXAM: SOLUTIONS SPRING 2015 / INSTRUCTOR: KLAUS MOELTNER This exam is ope-book, ope-otes, but please work strictly o your ow. Please make sure your ame is o every sheet you re hadig
More informationElement sampling: Part 2
Chapter 4 Elemet samplig: Part 2 4.1 Itroductio We ow cosider uequal probability samplig desigs which is very popular i practice. I the uequal probability samplig, we ca improve the efficiecy of the resultig
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More information4. Partial Sums and the Central Limit Theorem
1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems
More informationCS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5
CS434a/54a: Patter Recogitio Prof. Olga Veksler Lecture 5 Today Itroductio to parameter estimatio Two methods for parameter estimatio Maimum Likelihood Estimatio Bayesia Estimatio Itroducto Bayesia Decisio
More information1 Introduction to reducing variance in Monte Carlo simulations
Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by
More informationBayesian Methods: Introduction to Multi-parameter Models
Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationConvergence of random variables. (telegram style notes) P.J.C. Spreij
Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space
More informationSolutions: Homework 3
Solutios: Homework 3 Suppose that the radom variables Y,...,Y satisfy Y i = x i + " i : i =,..., IID where x,...,x R are fixed values ad ",...," Normal(0, )with R + kow. Fid ˆ = MLE( ). IND Solutio: Observe
More informationClases 7-8: Métodos de reducción de varianza en Monte Carlo *
Clases 7-8: Métodos de reducció de variaza e Mote Carlo * 9 de septiembre de 27 Ídice. Variace reductio 2. Atithetic variates 2 2.. Example: Uiform radom variables................ 3 2.2. Example: Tail
More informationLet us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.
Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,
More informationEECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1
EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationProbability and statistics: basic terms
Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample
More informationAsymptotics. Hypothesis Testing UMP. Asymptotic Tests and p-values
of the secod half Biostatistics 6 - Statistical Iferece Lecture 6 Fial Exam & Practice Problems for the Fial Hyu Mi Kag Apil 3rd, 3 Hyu Mi Kag Biostatistics 6 - Lecture 6 Apil 3rd, 3 / 3 Rao-Blackwell
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More informationCS284A: Representations and Algorithms in Molecular Biology
CS284A: Represetatios ad Algorithms i Molecular Biology Scribe Notes o Lectures 3 & 4: Motif Discovery via Eumeratio & Motif Represetatio Usig Positio Weight Matrix Joshua Gervi Based o presetatios by
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationStochastic Simulation
Stochastic Simulatio 1 Itroductio Readig Assigmet: Read Chapter 1 of text. We shall itroduce may of the key issues to be discussed i this course via a couple of model problems. Model Problem 1 (Jackso
More informationThis exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.
Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the
More informationLarge Sample Theory. Convergence. Central Limit Theorems Asymptotic Distribution Delta Method. Convergence in Probability Convergence in Distribution
Large Sample Theory Covergece Covergece i Probability Covergece i Distributio Cetral Limit Theorems Asymptotic Distributio Delta Method Covergece i Probability A sequece of radom scalars {z } = (z 1,z,
More informationIntroduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT
Itroductio to Extreme Value Theory Laures de Haa, ISM Japa, 202 Itroductio to Extreme Value Theory Laures de Haa Erasmus Uiversity Rotterdam, NL Uiversity of Lisbo, PT Itroductio to Extreme Value Theory
More informationSAMPLING LIPSCHITZ CONTINUOUS DENSITIES. 1. Introduction
SAMPLING LIPSCHITZ CONTINUOUS DENSITIES OLIVIER BINETTE Abstract. A simple ad efficiet algorithm for geeratig radom variates from the class of Lipschitz cotiuous desities is described. A MatLab implemetatio
More informationof the matrix is =-85, so it is not positive definite. Thus, the first
BOSTON COLLEGE Departmet of Ecoomics EC771: Ecoometrics Sprig 4 Prof. Baum, Ms. Uysal Solutio Key for Problem Set 1 1. Are the followig quadratic forms positive for all values of x? (a) y = x 1 8x 1 x
More informationSample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.
ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally
More informationStudy the bias (due to the nite dimensional approximation) and variance of the estimators
2 Series Methods 2. Geeral Approach A model has parameters (; ) where is ite-dimesioal ad is oparametric. (Sometimes, there is o :) We will focus o regressio. The fuctio is approximated by a series a ite
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationOutput Analysis and Run-Length Control
IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%
More informationLecture 9: September 19
36-700: Probability ad Mathematical Statistics I Fall 206 Lecturer: Siva Balakrisha Lecture 9: September 9 9. Review ad Outlie Last class we discussed: Statistical estimatio broadly Pot estimatio Bias-Variace
More informationUnbiased Estimation. February 7-12, 2008
Ubiased Estimatio February 7-2, 2008 We begi with a sample X = (X,..., X ) of radom variables chose accordig to oe of a family of probabilities P θ where θ is elemet from the parameter space Θ. For radom
More information62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +
62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of
More informationThe Central Limit Theorem
Chapter The Cetral Limit Theorem Deote by Z the stadard ormal radom variable with desity 2π e x2 /2. Lemma.. Ee itz = e t2 /2 Proof. We use the same calculatio as for the momet geeratig fuctio: exp(itx
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More information1.010 Uncertainty in Engineering Fall 2008
MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval
More informationJanuary 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS
Jauary 25, 207 INTRODUCTION TO MATHEMATICAL STATISTICS Abstract. A basic itroductio to statistics assumig kowledge of probability theory.. Probability I a typical udergraduate problem i probability, we
More informationLECTURE 8: ASYMPTOTICS I
LECTURE 8: ASYMPTOTICS I We are iterested i the properties of estimators as. Cosider a sequece of radom variables {, X 1}. N. M. Kiefer, Corell Uiversity, Ecoomics 60 1 Defiitio: (Weak covergece) A sequece
More informationA Note on Box-Cox Quantile Regression Estimation of the Parameters of the Generalized Pareto Distribution
A Note o Box-Cox Quatile Regressio Estimatio of the Parameters of the Geeralized Pareto Distributio JM va Zyl Abstract: Makig use of the quatile equatio, Box-Cox regressio ad Laplace distributed disturbaces,
More information3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense,
3. Z Trasform Referece: Etire Chapter 3 of text. Recall that the Fourier trasform (FT) of a DT sigal x [ ] is ω ( ) [ ] X e = j jω k = xe I order for the FT to exist i the fiite magitude sese, S = x [
More informationStatistical Pattern Recognition
Statistical Patter Recogitio Classificatio: No-Parametric Modelig Hamid R. Rabiee Jafar Muhammadi Sprig 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Ageda Parametric Modelig No-Parametric Modelig
More informationTAMS24: Notations and Formulas
TAMS4: Notatios ad Formulas Basic otatios ad defiitios X: radom variable stokastiska variabel Mea Vätevärde: µ = X = by Xiagfeg Yag kpx k, if X is discrete, xf Xxdx, if X is cotiuous Variace Varias: =
More informationLecture 3. Properties of Summary Statistics: Sampling Distribution
Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary
More informationBinomial Distribution
0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible
More informationThe Sampling Distribution of the Maximum. Likelihood Estimators for the Parameters of. Beta-Binomial Distribution
Iteratioal Mathematical Forum, Vol. 8, 2013, o. 26, 1263-1277 HIKARI Ltd, www.m-hikari.com http://d.doi.org/10.12988/imf.2013.3475 The Samplig Distributio of the Maimum Likelihood Estimators for the Parameters
More informationEFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS
EFFECTIVE WLLN, SLLN, AND CLT IN STATISTICAL MODELS Ryszard Zieliński Ist Math Polish Acad Sc POBox 21, 00-956 Warszawa 10, Polad e-mail: rziel@impagovpl ABSTRACT Weak laws of large umbers (W LLN), strog
More informationProbability and MLE.
10-701 Probability ad MLE http://www.cs.cmu.edu/~pradeepr/701 (brief) itro to probability Basic otatios Radom variable - referrig to a elemet / evet whose status is ukow: A = it will rai tomorrow Domai
More informationChapter 6 Infinite Series
Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat
More informationMonte Carlo method and application to random processes
Mote Carlo method ad applicatio to radom processes Lecture 3: Variace reductio techiques (8/3/2017) 1 Lecturer: Eresto Mordecki, Facultad de Ciecias, Uiversidad de la República, Motevideo, Uruguay Graduate
More informationCSE 527, Additional notes on MLE & EM
CSE 57 Lecture Notes: MLE & EM CSE 57, Additioal otes o MLE & EM Based o earlier otes by C. Grat & M. Narasimha Itroductio Last lecture we bega a examiatio of model based clusterig. This lecture will be
More informationTests of Hypotheses Based on a Single Sample (Devore Chapter Eight)
Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........
More informationThe Bayesian Learning Framework. Back to Maximum Likelihood. Naïve Bayes. Simple Example: Coin Tosses. Given a generative model
Back to Maximum Likelihood Give a geerative model f (x, y = k) =π k f k (x) Usig a geerative modellig approach, we assume a parametric form for f k (x) =f (x; k ) ad compute the MLE θ of θ =(π k, k ) k=
More informationChapter 10: Power Series
Chapter : Power Series 57 Chapter Overview: Power Series The reaso series are part of a Calculus course is that there are fuctios which caot be itegrated. All power series, though, ca be itegrated because
More informationStatistical Theory MT 2009 Problems 1: Solution sketches
Statistical Theory MT 009 Problems : Solutio sketches. Which of the followig desities are withi a expoetial family? Explai your reasoig. (a) Let 0 < θ < ad put f(x, θ) = ( θ)θ x ; x = 0,,,... (b) (c) where
More informationLecture 33: Bootstrap
Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece
More informationLast Lecture. Biostatistics Statistical Inference Lecture 16 Evaluation of Bayes Estimator. Recap - Example. Recap - Bayes Estimator
Last Lecture Biostatistics 60 - Statistical Iferece Lecture 16 Evaluatio of Bayes Estimator Hyu Mi Kag March 14th, 013 What is a Bayes Estimator? Is a Bayes Estimator the best ubiased estimator? Compared
More informationSince X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain
Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS
MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak
More information5 : Exponential Family and Generalized Linear Models
0-708: Probabilistic Graphical Models 0-708, Sprig 206 5 : Expoetial Family ad Geeralized Liear Models Lecturer: Matthew Gormley Scribes: Yua Li, Yichog Xu, Silu Wag Expoetial Family Probability desity
More informationThis section is optional.
4 Momet Geeratig Fuctios* This sectio is optioal. The momet geeratig fuctio g : R R of a radom variable X is defied as g(t) = E[e tx ]. Propositio 1. We have g () (0) = E[X ] for = 1, 2,... Proof. Therefore
More informationStatistical Theory MT 2008 Problems 1: Solution sketches
Statistical Theory MT 008 Problems : Solutio sketches. Which of the followig desities are withi a expoetial family? Explai your reasoig. a) Let 0 < θ < ad put fx, θ) = θ)θ x ; x = 0,,,... b) c) where α
More informationSequences and Series of Functions
Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges
More informationPRELIM PROBLEM SOLUTIONS
PRELIM PROBLEM SOLUTIONS THE GRAD STUDENTS + KEN Cotets. Complex Aalysis Practice Problems 2. 2. Real Aalysis Practice Problems 2. 4 3. Algebra Practice Problems 2. 8. Complex Aalysis Practice Problems
More informationECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015
ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],
More informationStatisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
More informationInfinite Sequences and Series
Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More information6. Sufficient, Complete, and Ancillary Statistics
Sufficiet, Complete ad Acillary Statistics http://www.math.uah.edu/stat/poit/sufficiet.xhtml 1 of 7 7/16/2009 6:13 AM Virtual Laboratories > 7. Poit Estimatio > 1 2 3 4 5 6 6. Sufficiet, Complete, ad Acillary
More informationSDS 321: Introduction to Probability and Statistics
SDS 321: Itroductio to Probability ad Statistics Lecture 23: Cotiuous radom variables- Iequalities, CLT Puramrita Sarkar Departmet of Statistics ad Data Sciece The Uiversity of Texas at Austi www.cs.cmu.edu/
More informationMa 4121: Introduction to Lebesgue Integration Solutions to Homework Assignment 5
Ma 42: Itroductio to Lebesgue Itegratio Solutios to Homework Assigmet 5 Prof. Wickerhauser Due Thursday, April th, 23 Please retur your solutios to the istructor by the ed of class o the due date. You
More informationLecture Notes 15 Hypothesis Testing (Chapter 10)
1 Itroductio Lecture Notes 15 Hypothesis Testig Chapter 10) Let X 1,..., X p θ x). Suppose we we wat to kow if θ = θ 0 or ot, where θ 0 is a specific value of θ. For example, if we are flippig a coi, we
More informationG. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan
Deviatio of the Variaces of Classical Estimators ad Negative Iteger Momet Estimator from Miimum Variace Boud with Referece to Maxwell Distributio G. R. Pasha Departmet of Statistics Bahauddi Zakariya Uiversity
More informationComparison Study of Series Approximation. and Convergence between Chebyshev. and Legendre Series
Applied Mathematical Scieces, Vol. 7, 03, o. 6, 3-337 HIKARI Ltd, www.m-hikari.com http://d.doi.org/0.988/ams.03.3430 Compariso Study of Series Approimatio ad Covergece betwee Chebyshev ad Legedre Series
More informationStat410 Probability and Statistics II (F16)
Some Basic Cocepts of Statistical Iferece (Sec 5.) Suppose we have a rv X that has a pdf/pmf deoted by f(x; θ) or p(x; θ), where θ is called the parameter. I previous lectures, we focus o probability problems
More informationSequences. Notation. Convergence of a Sequence
Sequeces A sequece is essetially just a list. Defiitio (Sequece of Real Numbers). A sequece of real umbers is a fuctio Z (, ) R for some real umber. Do t let the descriptio of the domai cofuse you; it
More information4.5 Multiple Imputation
45 ultiple Imputatio Itroductio Assume a parametric model: y fy x; θ We are iterested i makig iferece about θ I Bayesia approach, we wat to make iferece about θ from fθ x, y = πθfy x, θ πθfy x, θdθ where
More informationAdvanced Stochastic Processes.
Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.
More information7.1 Convergence of sequences of random variables
Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite
More informationChapter 13: Tests of Hypothesis Section 13.1 Introduction
Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed
More informationNANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS
NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS STRUCTURE OF EXAMINATION PAPER. There will be oe 2-hour paper cosistig of 4 questios.
More informationSeries III. Chapter Alternating Series
Chapter 9 Series III With the exceptio of the Null Sequece Test, all the tests for series covergece ad divergece that we have cosidered so far have dealt oly with series of oegative terms. Series with
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More information1 Approximating Integrals using Taylor Polynomials
Seughee Ye Ma 8: Week 7 Nov Week 7 Summary This week, we will lear how we ca approximate itegrals usig Taylor series ad umerical methods. Topics Page Approximatig Itegrals usig Taylor Polyomials. Defiitios................................................
More informationExercise 4.3 Use the Continuity Theorem to prove the Cramér-Wold Theorem, Theorem. (1) φ a X(1).
Assigmet 7 Exercise 4.3 Use the Cotiuity Theorem to prove the Cramér-Wold Theorem, Theorem 4.12. Hit: a X d a X implies that φ a X (1) φ a X(1). Sketch of solutio: As we poited out i class, the oly tricky
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More informationKinetics of Complex Reactions
Kietics of Complex Reactios by Flick Colema Departmet of Chemistry Wellesley College Wellesley MA 28 wcolema@wellesley.edu Copyright Flick Colema 996. All rights reserved. You are welcome to use this documet
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationAsymptotic Results for the Linear Regression Model
Asymptotic Results for the Liear Regressio Model C. Fli November 29, 2000 1. Asymptotic Results uder Classical Assumptios The followig results apply to the liear regressio model y = Xβ + ε, where X is
More information